N PUBLISHED UNDER THE PATENT COOraBPTTI 



t 

(12) INTERNATIONAL APPLI<5^TON PUBLISHED UNDER THE PATENT COOI^RTlON TREATY (PCT) 





(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date (10) International Publication Number 

9 August 2001 (09.08.2001) pct WO 01/57188 A2 



(51) International Patent Classification 7 : C12N 

(21) International Application Number: PCT/U SO 1/03800 

(22) International Filing Date: 5 February' 2001 (05.02.2001 ) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

09/496,914 
09/560,875 



3 February 2000 (03.02.2000) US 
27 April 2000 (27.04.2000) US 



(63) Related by continuation (CON) or continuation-in-part 
(OP) to earlier applications: 

US 09/496,914 (CI P) 

Filed on 3 February 2000 (03.02.2CXX)) 

US 09/560,875 (CIP) 

Filed on 27 April 2000 (27.04.2000) 

(71) Applicant (for all des ignated Slates except US) : H Y S EQ, 
INC. 1US/USJ; 670 Almanor Avenue, Sunnyvallc, CA 
94086 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): TANG, Y., Tom 

IUS/USJ; 4230 Ranwick Court, San Jose, CA 95 1 18 (US). 
LIU, Chenghua |CN/US]; 1 125 Ranchcro Way, #14, San 



Jose, CA 951 17 (US). DRMANAC, Radoje, T. |YU/US|; 
850 Fast Greenwich Place, Palo Alto, CA 94303 (US). 

(74) Agent: ELRIFI, Ivor, R.; Mint/, Levin, Coh, Ferris, 
Glovsky, and Popco, P.C., One Financial Center, Boston, 
MA 021 1 1 (US). 

(81) Designated States (national)'. AF, AG, AL, AM, AT, AU, 

AZ, BA, BB, BG, BR, BY, BZ, CA, CM, CN, CR, CU, CZ, 
DE, DK, DM, DZ, FF, FS, Fl, GB, GD, GF, CiM, GM, MR, 
ITU, ID, IL, IN, IS, JP, Kit, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, IT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GM, GM, 
KF, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CM, CY, DF, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SF, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two- letter codes and otfier abbreviations, refer to the "Guid- 
ance Notes on Codes and A bbrcviations " appearing at the begin- 
ning of each regular issue of the PCT Gazette, 



< 

00 
00 

id 



(54) Title: NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

O 



(57) Abstract: The present invcnlion provides novel nucleic acids, novel polypeptide sequences encoded by these nucleic acids and 



uses thereof. 



3DOCID: <WO 0157188A2_L> 



BEST AVAILABLE COPY 



■ -V 



WO 01/57188 




PCT/USO 1/03800 



NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

1. TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
5 polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

2. BACKGROUND 



10 lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 
polynucleotides "directly* 1 in the sense that they rely on information directly related to the 
discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 

15 "indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 
based on the presence of a now well-recognized secretory leader sequence motif, as well as 
various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 

20 case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 

25 and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
3 0 cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 
epitopes present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
3 5 polynucleotides and cells genetically engineered to express such polynucleotides. 
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The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 
5 diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid 

sequences are designated as SEQ ID NO: 1-1350. The polypeptides sequences are designated SEQ 
ID NO: 1351 -2700. The nucleic acids and polypeptides are provided in the Sequence Listing. In 
the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is guanine; T is 
thymine; and N is any of the four bases. In the amino acids provided in the Sequence Listing, * 

1 0 corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
hybridize to the complement of SEQ ID NO: 1-1350 under stringent hybridization conditions; 
nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid 
sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 

15 domain or truncation of the peptides encoded by SEQ IDNO:1-1350 . A polynucleotide 

comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 
ID NO: 1-1350 or a degenerate variant or fragment thereof. The identifying sequence can be 100 
base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 
20 from the nucleic acid sequences of SEQ ID NO: 1-1350. The sequence information can be a 
segment of any one of SEQ ID NO: 1 -1 350 that uniquely identifies or represents the sequence 
information of SEQ ID NO:1-1350. 

A collection as used in this application can be a collection of only one polynucleotide. The 
collection of sequence information or identifying information of each sequence can be provided on 
25 a nucleic acid array. In one embodiment, segments of sequence information is provided on a 

nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full-match or mismatch to the polynucleotide that contains the segment. The collection 
can also be provided in a computer-readable fonnat. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
30 sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their 
reverse or direct complements) according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readable media, use in sequencing 
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full-length genes, use for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 -1 350 or novel 
segments or parts of the nucleic acids of the invention are used as primers in expression assays that 
5 are well known in the art. In a particularly preferred embodiment, the nucleic acid sequences of 
SEQ ID NO: 1 -1 350 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath 
et aL, Science 258:52-59 (1 992), as expressed sequence tags for physical mapping of the human 
genome. 

1 0 The isolated polynucleotides of the invention include, but are not limited to, a 

polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1 - 1 350; a 
polynucleotide comprising any of the full length protein coding sequences of SEQ ID NO: 1 - 1350; 
and a polynucleotide comprising any of the nucleotide sequences of the mature protein coding 
sequences of SEQ ID NO: 1- 1350. The polynucleotides of the present invention also include, but 
1 5 are not limited to ? a polynucleotide that hybridizes under stringent hybridization conditions to (a) 
the complement of any one of the nucleotide sequences set forth in SEQ ID NO: 1 -1 350; (b) a 
nucleotide sequence encoding any one of the amino acid sequences set forth in the Sequence Listing 
(e.g., SEQ ID NO: 1351-2700); (c) a polynucleotide which is an allelic variant of any 
polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. 
20 orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino acid 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
25 full length or mature protein. Polypeptides of the invention also include polypeptides with 

biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forth in SEQ ID NO: 1-1 350; or (b) polynucleotides that hybridize to the complement of the 
polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 
30 equivalents" thereof (e.g., with at leastabout 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% 
amino acid sequence identity) that preferably retain biological activity are also contemplated. The 
polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 
produced by recombinant means using the genetically engineered cells (e.g. host cells) of the 
invention. 

3 
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The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The invention also provides host cells transformed or transfected with a polynucleotide of 
the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
techniques known to those skilled in the art of molecular biology. These techniques include use 
as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
largely restricted to a particular* cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et aL, Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
expression or biological activity. 
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The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, b 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
identification of subjects exhibiting a predisposition to such conditions. The invention provides 

5 a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compound that binds to and forms a complex with the polynucleotide of 
interest for a period sufficient to form the complex and under conditions sufficient to form a 
complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invention also provides a method for detecting tire polypeptides of the 

1 0 invention in a sample comprising contacting the sample with a compound that binds to and form 
a complex with the polypeptide under conditions and for a period sufficient to form the complex 
and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
1 5 antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 
monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
recited above. 

The invention also provides methods for the identification of compounds that modulate 
20 (z. e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compound 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and other substances that interact with (e.g., 
bind to) the polypeptides of the invention. The invention provides a method for identifying a 
25 compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 
sequence in the cell; and detecting the complex by detecting the reporter gene sequence 
expression such that if expression of the reporter gene is detected the compound the binds to a 
30 polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve the 
administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases o 
disorders as recited herein comprising administering compounds and other substances that 
35 modulate the overall activity of the target gene products. Compounds and other substances can 
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effect such modulation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
useful for the same functions known to one of skill in the art as the polypeptides and 
5 polynucleotides to which they have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful for 
a variety of applications, as described herein, including use in arrays for detection. 



1 0 4. DETAILED DESCRIPTION OF THE INVENTION 

4,1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

15 The term "active" refers to those forms of the polypeptide which retain the biologic 

and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
Likewise "immunologically active" or "immunological activity" refers to the capability of the 

20 natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafficking, including the export of secretory or 
enzymatic molecules as part of a normal or disease process. 

25 The terms "complementary" or "complementarity" refer to the natural binding of 

polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded molecules 
may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 

30 complementarity between the nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 

35 and continuous source of germ cells for the production of gametes. The term "primordial germ 
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cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 
are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
5 not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment, " EMF, means a series of nucleotides which 
modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
10 sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 
The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
1 5 "oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 

sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 
20 (U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 
acid which is capable of being expressed in a recombinant transcriptional unit comprising 
25 regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 
residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
30 most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 
nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 
35 nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
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be used in polymerase chain reaction (PCR), various hybridization procedures or microarray 
procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
5 IDNOs:l-1350. 

Probes may, for example, be used to determine whether specific mRJM A molecules are 
present in a ceil or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
described by Walsh et al. (Walsh, P.S. et aL, 1 992 ? PCR Methods Appl 1 :241-250). They may 
be labeled by nick translation, KJenow fill-in reaction, PCR, or other methods well known in the 
1 0 art. Probes of the present invention, their preparation and/or labeling are elaborated in 

Sambrook, J. et aL, 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John 
Wiley & Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. 

15 The nucleic acid sequences of the present invention also include the sequence 

information from the nucleic acid sequences of SEQ ID NO: 1-1 3 50. The sequence information 
can be a segment of any one of SEQ ID NO:1-1350 that uniquely identifies or represents the 
sequence information of that sequence of SEQ ID NO:1-1350. One such segment can be a 
twenty-mer nucleic acid sequence because the probability that a twenty-mer is fully matched in 

20 the human genome is 1 in 300. In the human genome, there are three billion base pairs in one set 
of chromosomes. Because 4 20 possible twenty-mers exist, there are 300 times more twenty-mers. 
than there are base pairs in a set of human chromosomes. Using the same analysis, the 
probability for a seventeen-mer to be fully matched in the human genome is approximately 1 in 
5. When these segments are used in arrays for expression studies, fifteen-mer segments can be 

25 used. The probability that the fifteen-mer is fully matched in the expressed sequences is also 

approximately one in five because expressed sequences comprise less than approximately 5% of 
the entire genome sequence. 

Similarly , when using sequence information for detecting a single mismatch, a segment can 
be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 

30 with a single mismatch is calculated by multiplying the probability for a full match (1 -s-4 25 ) times the 
increased probability for mismatch at each nucleotide position (3 x 25). The probability that an 
eighteen mer with a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 
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The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
kmino acids without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related nucleic 
acid sequences. For example, a promoter is operably associated or operably linked with a coding 
5 sequence if the promoter controls the transcription of the coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 
elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number of 
10 differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
differentiation capability in comparison to a totipotent cell. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 
peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
1 5 acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
preferably at least about 9 amino acids and most preferably at least about 17 or more amino 
acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
than 150 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 ammo acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 
have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 
25 The term "translated protein coding portion" means a sequence which encodes for the full 

length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
30 produced by processing in the cell which removes any leader/signal sequence. The mature 

protein portion may or may not include the initial methionine residue. The methionine residue 
may be removed from the protein during processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 
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The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 
5 in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 

10 the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 
or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 

15 substitutions, such as the silent changes which produce various restriction sites, may be 
introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 
the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as Hgand-binding affinities, interchain 

20 affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 

25 nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 

30 "deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 

amino acids. The variation allowed may be experimentally determined by systematically making 
insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 
recombinant DNA techniques and assaying the resulting recombinant variants for activity. 
Alternatively, where alteration of function is desired, insertions, deletions or 

35 non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
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can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 
rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
5 for expression, scale up and the like in the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 

eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the indicated 
nucleic acid or polypeptide is present in the substantial absence of other biological 
10 macromolecules, e.g. , polynucleotides, proteins., and the like. In one embodiment, the 

polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 
preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 daltons, can be present). 
1 5 The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 

at least one other component {e.g., nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 
20 polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant {e.g. , microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 
bacterial or fungal {e.g. , yeast) expression systems. As a product, "recombinant microbial" 
25 defines a polypeptide or protein essentially free of native endogenous substances and 

unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. colt, will be free of glycosylation modifications; polypeptides or 
proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells. 
30 The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 

or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 
comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 
having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
35 appropriate transcription initiation and termination sequences. Structural units intended for use 

u 
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in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
methionine residue. This residue may or may not be subsequently cleaved from the expressed 
5 recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or carry the recombinant 
transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 

10 to the DNA segment or synthetic gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 
gene expression, for example, promoters or enhancers. Recombinant expression systems as 
defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 

15 can be prokaryotic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a membrane, 
including transport as a result of signal sequences in its amino acid sequence when it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g., soluble proteins) or partially (e.g.. receptors) from the cell in which they are 

20 expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 
P.A. and Young, P.R. (1992) Cytokine 4(2): 134 -143) and factors released from damaged cells 
(e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 

25 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 
may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

30 The term "stringent" is used to refer to conditions that are commonly understood in the 

art as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization 
to filter-bound DNA in 0.5 M NaHP0 4 > 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent conditions (Le. y 
washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are 

35 described herein in the examples. 
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In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 
14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 
60°C (for 23-base oligonucleotides). 
5 As used herein, "substantially equivalent" can refer both to nucleotide and amino acid 

sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 
functional dissimilarity between the reference and subject sequences. Typically, such a 
substantially equivalent sequence varies from one of those listed herein by no more than about 
10 35% (/. e. , the number of individual residue substitutions, additions, and/or deletions in a 

substantially equivalent sequence, as compared to the corresponding reference sequence, divided 
by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. In one 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 
1 5 listed sequence by no more than 30% (70% sequence identity); in a variation of this 

embodiment, by no more than 25% (75% sequence identity); and in a further variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a further variation of this 
embodiment, by no more than 1 0% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., 
20 mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 
nucleotide sequences of the invention can have lower percent sequence identities, taking into 
25 account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
30 about 99% sequence identity. For the purposes of the present invention, sequences having 

substantially equivalent biological activity and substantially equivalent expression characteristics 
are considered substantially equivalent. For the purposes of determining equivalence, truncation 
of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. 
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(1990) Methods Enzymol. 183:626-645). Identity between sequences can also be deLemiined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the cell 
types of an adult organism. 
5 The term "transformation" means introducing DNA into a suitable host cell so that the 

DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

10 As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 

which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 

15 with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
determined. As described above, a UMF will increase the frequency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 

20 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 
The isolated polynucleotides of the invention include a polynucleotide comprising the 
nucleotide sequences of SEQ ID NO:1-1350 ; a polynucleotide encoding any one of the peptide 

25 sequences of SEQ ID NO: 1351 -2700; and a polynucleotide comprising the nucleotide sequence 
encoding the mature protein coding sequence of the polypeptides of any one of SEQ ID 
NO: 1351 -2700. The polynucleotides of the present invention also include, but are not limited to, 
a polynucleotide that hybridizes under stringent conditions to (a) the complement of any of the 
nucleotides sequences of SEQ ID NO: 1-1 350 ; (b) nucleotide sequences encoding any one of the 

30 amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide which is an allelic 
variant of any polynucleotide recited above; (d) a polynucleotide which encodes a species 
homolog of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 135 1-2700. 
Domains of interest may depend on the nature of the encoded polypeptide; e.g., domains in 

35 receptor-like polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic 
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domains, or combinations thereof; domains in immunoglobulin-like proteins include the variable 
immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and 
substrate binding domains; and domains in ligand polypeptides include receptor-binding 
domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences disclosed 
herein. The corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes or primers 
from the disclosed sequence information for identification and/or amplification of genes in 
appropriate genomic libraries or other sources of genomic materials. Further 5' and 3 1 sequence can 
be obtained using methods known in the art. For example, full length cDNA or genomic DNA that 
correspondstoanyofthepolynucleotidesofSEQIDNO.1-1350 canbe obtained by screening 
appropriate cDN A or genomic DNA libraries under suitable hybridization conditions using any of 
the polynucleotides of SEQ ID NOT-1350 or a portion thereof as a probe. Alternatively, the 
polynucleotides of SEQ ID NO:1-1350 may be used as the basis for suitable primer(s)that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and sequences 
(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment information for the full-length 
gene. 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g. , at least about 65%, at least about 70%, at least about 
75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 
sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ID NO:l -1350, or complements thereof, which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 
preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 
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are selective for {i.e. specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 
the same family of genes or can differentiate human genes from genes of other species, and are 
5 preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
variations can be routinely determined by comparing the sequence provided SEQ ID NO: 1 - 1 350, a 
representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 

1 0 identical, to SEQ ID NO : 1 - 1 3 50 with a sequence from another isolate of the same species. 

Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 
coding for the same amino acid sequences as do the specific ORPs disclosed herein. In other words, 
in the coding region of an ORF, substitution of one codon for another codon that encodes the same 
amino acid is expressly contemplated. 

1 5 The nearest neighbor or homology result for the nucleic acids of the present invention, 

including SEQ ID NO: 1-1350, can be obtained by searching a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1 993) and Altschul 
SJF. et al. J. Mol. Biol. 21 :403-410 (1990)). Alternatively a FASTA version 3 search against 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 
acid source from the desired species. 

25 The invention also encompasses allelic variants of the disclosed polynucleotides or 

proteins; that is. naturally-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences which 
30 encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 
encoding the amino acid sequence variants are preferably constructed by mutating the 
35 polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic 
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acid alterations can be made at sites that differ in the nucleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
will typically be modified in series, e.g., by substituting first with conservative choices (e.g., 
hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant 
5 choices (e.g. , hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 
insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino acid 
10 residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 

preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal 
sequences necessary for secretion or for intracellular targeting in different host cells and 
sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. 
In a preferred method, polynucleotides encoding the novel amino acid sequences are 
1 5 changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 
nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified by publications such as, Edelman et al., 
20 DNA 2:183 (1983). A versatile and efficient method for producing site-specific changes in a 
polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 
(1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primer(s) that differs 
slightly in sequence from the corresponding region in the template DNA can generate the desired 
25 amino acid variant. PCR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fragments replace the corresponding region in the plasmid and this 
gives a polynucleotide encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
30 technique described in Wells et al., Gene 34:3 15 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et al., supra, and Cun-ent 
Protocols in Moleculw Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 
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of these novel nucleic acids. Such DNA sequences include those which cure capable of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can he used 
to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
5 domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic. cDNA, amplified, or 
synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods for determining hybridization 

10 conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO: 1-1350 9 or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also 

15 included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 
selectable marker for the host cell. Vectors according to the invention include expression 

25 vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO: 1 -1 350 or a fragment thereof or any other 

30 polynucleotides of the invention. In one embodiment, the recombinant constructs of the present 
invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid having 
any of the nucleotide sequences of SEQ ID NO: 1-1350 or a fragment thereof is inserted, in a 
forward or reverse orientation. In the case of a vector comprising one of the ORFs of the' present 
invention, the vector may further comprise regulatory sequences, including for example, a 

35 promoter, operably linked to the ORF. Large .lumbers of suitable vectors and promoters are 
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known to those of skill in the art and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of example. 
Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, P NHl6a, pNH18a, 
P NH46a (Stratagene); P Trc99A, pKK223-3, pKK233-3, P DR540, P RJT5 (Pharmacia). 
5 Eukaryotic: pWLneo, P SV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al., 
Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinant^. Many 
10 suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufman, Methods in 
Enzymology 1 85, 537-566 (1990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 
1 5 (transfected) with the ligatcd polynucleotide/exprcssion control sequence. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 
20 kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 
and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 
25 transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3 -phosphogly cerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 
assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 
30 periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed recombinant product. 
Useful expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 
35 signals in operable reading phase with a functional promoter. The vector will comprise one or 
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more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus suhtilis > Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be 
5 employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 

10 Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 
sequence to be expressed. Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 

15 additional period. Cells are typically harvested by centrifiigation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further purification. . 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et al., Not. Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 

25 4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO: 1-1 350, or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
30 encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 

molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 
100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NO: 135 1-2700 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ ID 
NO-.1-1350 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
5 to the region of the nucleotide sequence comprising codons which are tr anslated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
"noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 
10 Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g. , SEQ 

ID NO:1-1350), antisense nucleic acids of the invention can be designed according to the rules 
of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 
complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 
15 the antisense oligonucleotide can be complementary to the region surrounding the translation 

start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g. , an antisense oligonucleotide) can be 
20 chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 
25 include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1 -methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
30 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymemyluracil, 5-methoxyuracil, 
2-methylthio-N6-isopentenyladenine, uracil-5-oxyacctic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouraciI, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 
35 3-(3-amino-3-N-2-carboxypropyl) uracil, (ac P 3)w, and 2,6-diaminopurine. Alternatively, the 
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antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 
5 The antisense nucleic acid molecules of the invention are typically administered to a 

subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 

1 0 an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 

15 such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g. , 
by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 

20 control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the 
strands run parallel to each other (Gaultier et ah (1987) Nucleic Acids Res 15: 6625-6641). The 

25 antisense nucleic acid molecule can also comprise a 2 -o-methylribonucleotide (Inoue et ah 

(1987) Nucleic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue et aL (1987) 
FEBS Lett 215: 327-330). 

4.4 RIBOZYMES AND PNA MOIETIES 

30 In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 

Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 

35 translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO:l- 
1350). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which 
the nucleotide sequence of the active site is complementary to the nucleotide sequence to be 
cleaved in a SECX-encoding mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,071 ; and Cech et 
5 al. U.S. Pat. No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a catalytic RNA 
having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al, 

(1993) Science 261:141 1-141 8. 

Alternatively, gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 
10 structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) 
Anticancer Drug Des. 6: 569-84; Helene. et al. (1992) Ann. N. Y. Acad. Sci. 660:27-36; and 
Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 
15 solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 
acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1996) BioorgMed 
Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g. , DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
20 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 
Perry-O'Keefe et al. (1996) PNAS 93: 14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 
25 example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 
gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 
30 primers for DNA sequence and hybridization (Hyrup et al. (1996), above; Perry-O'Keefe (1996), 
above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
35 delivery known m the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 
using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 
the nucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5 , -(4-methox> r trityl)amino-5 l -deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5 f end of DNA (Mag et al. (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise mamier to produce a chimeric molecule with a 5* PNA segment and a 3' 
DNA segment (Finn et al. (1996) above). Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment. See, Petersen et al. (1 975) Bioorg Med Chem 
LettS: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Set U.S.A. 86:6553-6556; 
Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). In addition, 
oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g., Kxol et 
aL, 1988, BioTechmques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 
5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

4.5 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
methods. The present invention still further provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
with a regulatory sequence heterologous to the host cell which drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 
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recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
is operativeiy linked to the encoding sequences. See, for example, PCT International Publication 
5 No. WO94/12650, PCT International Publication No. WO92/2080S, and PCT International 

Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 
10 sequence, amplification of the marker DNA by standard selection methods results in co- 
amplification of the desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
1 5 calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, 
L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 
20 Any host/vector system can be used to express one or more of the ORFs of the present 

invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and S£9 cells, as well as prokaryotic host such as E. coll and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 
protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 
25 be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al. 5 in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 
30 York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the CI 27, monkey COS cells, Chinese Hamster Ovary 
35 (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 
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cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, primary explants. HeLa cells, mouse L cells, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 
replication, a suitable prompter and also any necessary ribosome binding sites, polyadenylation 
5 site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 

nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 
10 more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 

refolding steps can be used, as necessary, in completing configuration of the mature protein. 
Finally, high performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 



Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 
or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 
attachments may be accomplished using known chemical or enzymatic methods. 

25 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 
may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 

30 gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 

regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 
negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 

35 targeting. These sequence include polyadenylation signals. mRNA stability elements, splice 



15 



agents. 
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sites, leader sequences for enhancing or modi fying transport or secretion properties of the 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 

5 gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 

enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. 
Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 

10 the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 
the use of one or more selectable marker genes that are contiguous with the targeting DNA, 
allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 

15 more marker genes exhibiting the property of negative selection, such that the negatively 
selectable marker is linked to the exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers usefiil for this purpose include the 

20 Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 

25 PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 

PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 

4.6 POLYPEPTIDES OF THE INVENTION 

30 The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

comprising: the amino acid sequences set forth as any one of SEQ ID NO: 135 1-2700 or an 
amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO:1-1350 or the 
corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity that are encoded by: (a) a 

35 polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO:1-1350 or (b) 
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polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NO: 1351- 
2700 or (c) polynucleotides that hybridize to the complement of the polynucleotides of either (a) 
or (b) under stringent hybridization conditions. The invention also provides biologically active 
or immunologically active variants of any of the amino acid sequences set forth as SEQ ID 
5 NO: 1351 -2700 or the corresponding full length or mature protein; and "substantial equivalents" 
thereof (e.g., with at least about 65%, at least about 70%, at least about 75%, at least about 80%, 
at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 92%, 93%, 94%, typically at 
least about 95%, 96%, 97%, more typically at least about 98%, or most typically at least about 
99% amino acid identity) that retain biological activity. Polypeptides encoded by allelic variants 
10 may have a similar, increased, or decreased activity compared to polypeptides comprising SEQ 
IDNO:1351-2700. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using known methods, for example, as described in H. 

15 U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al. 3 J. Arner. 
Chem. Soc. 1 14 ? 9245-9253 (1992), both of which are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for example, 

20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature form of such protein may be obtained by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 
of the protein is also determinable from the amino acid sequence of the full-length form. Where 

25 proteins of the present invention are membrane bound, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fully secreted from the cell in which they are expressed. 

Protein compositions of the present invention may further comprise an acceptable carrier, 
such as a hydrophilic, e.g.. pharmaceutically acceptable, carrier. 

30 The present invention further provides isolated polypeptides encoded by the nucleic acid 

fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 
present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 

35 acid fragments of the present invention are the ORFs that encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
5 structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 
example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
1 0 therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 
cells which have been altered to express the desired polypeptide or protein. As used herein, a 
cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
1 5 which the cell normally produces at a lower level. One skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic sequences into 
eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising growing a 
20 culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
allow expression of the encoded polypeptide. The polypeptide can be recovered from the 
25 culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Preferred embodiments include those in which the protein produced by such 
process is a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells which 
naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
30 methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and 
Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
35 Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific protein 
domains. 

The purified polypeptides can be used in in vifro binding assays which are well known in 
5 the art to identify molecules which bind to the polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 

10 cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 
may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ ID NO:l 35 1-2700. 

15 The protein of the invention may also be expressed as a product of transgenic animals, 

e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 

made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino acid residue in the coding sequence. For example, one or more of the cysteine 
residues may be deleted or replaced with another amino acid to alter the conformation of the 

25 molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 
protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 

30 systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 
importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 



35 retain protein activity in whole or in part and are useful for screening or other immunological 



Other fragments and derivatives of the sequences of proteins which would be expected to 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
invention to suitable control sequences in one or more insect expression vectors, and employing 
5 an insect expression system. Materials and methods for baculovirus/insect cell expression 

systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif., U.S.A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
10 invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 
culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
15 of the protein may also include an affinity column containing agents which will bind to the 
protein; one or more column steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™ ; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffinity chromatography. 
20 Alternatively, the protein of the invention may also be expressed in a form which will 

facilitate purification. For example, it may be expressed as a fusion protein, such as those of , 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression and purification of such fusion proteins are commercially available 
from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 
25 respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 
steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 
30 aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 
homogeneous isolated recombinant protein. The protein thus purified is substantially free of 
other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 
modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
5 another moiety or moieties, e.g., targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 
provide for the delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
10 as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for example, 
immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 
steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferon. 

15 

4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 
AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J. ? et ah, Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 
et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. 
vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 

25 Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnharnmer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 
reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 
105-31 (1982). incorporated herein by reference). The BLAST programs are publicly available 

30 from the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 
Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
35 protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
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another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically active 
5 portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polypeptide according to the invention and the other 
polypeptide are fused in-frame to each other. The polypeptide can be fused to the N-terminus or 
C-terminus. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 
1 0 the invention operably linked to the extracellular domain of a second protei n. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 
polypeptide sequences of the invention arc fused to the C-terminus of the GST (i.e. , glutathione 

S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 
1 5 the polypeptide sequences according to the invention comprises one or more domains are fused 
to sequences derived from a member of the immunoglobulin protein family. The 
immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 
20 The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e.g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be 
used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 
25 to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 
are ligated together in-frame in accordance with conventional techniques, e.g., by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
30 appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 
35 subsequently be annealed and reamplified to generate a chimeric gene sequence ,see, for 

33 



■1SDOCID <WO 0157188A2_I_> 



WO 01/57188 




T/USO 1/03800 



example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 
invention can be cloned into such an expression vector such that the fusion moiety is linked 
5 in-frame to the protein of the invention. 

4.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 
function of the encoded protein. The invention thus provides gene therapy to restore normal 

10 activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example. 

15 Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of 
gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing the expression of 
or inhibiting the activity of polypeptides of the invention will be useful in treating the disease 

25 states. It is contemplated that antisense therapy or gene therapy could be applied to negatively 
regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 
molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
sequences, by methods known in the art. Further, the polypeptides of the present i nvention can be 

30 inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to express the 
polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 
the present invention. 

Knowledge of DN A sequences provided by the invention allows for modification of cells to 
permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g. , by 
5 homologous recombination) to provide increasedpolypeptide expression by replacing, in whole or 
in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
operatively linked to the desired protein encoding sequences. See, for example, PCT International 
PublicationNo. WO 94/12650, PCT International PublicationNo. WO 92/20808, and PCT 
10 International PublicationNo. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g. 9 ada, dhfr, and the multifunctional CAD gene which 
encodes carbamy 1 phosphate synthase, aspartate transcarbamylase, and dihy droorotase) and7 or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods results in 
1 5 co-amplificationof the desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 
20 replace a gene' s existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 
25 protein produced may be replaced, removed, added, or otherwise modified by targeting. These 

sequences include poly adenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences 
which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 
30 under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 
element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 
replaced by an enhancer that has broader or different cell-type specificity than the naturally 
3 5 occurring elements. Here, the naturally occurring sequences are deleted and new sequer ces are 
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added. In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. The identification of the 
targeting event may also be facilitated by the use of one or more marker genes exhibiting the 
property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a correct homologous recombination event with sequences in the host cell genome does 
not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 
aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to Chappel; 
U.S. Patent No. 5,578,461 to Sherwin et aL; International Application No. PCT/US92/09627 
(WO93/09222)by Selden et al.; and International Application No. PCT/US 90/0643 6 
(W09 1/06667) by Skoultchi et al.,each of which is incorporated by reference herein in its entirety. 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to detennine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 
replacing the homologous promoter to provide for increased protein expression. The homologous 
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promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
through, e.g., homologous recombination or knock out strategies, of animals that fail to express 
5 polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

10 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 

1 5 prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to determine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 

20 Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or pari of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 
methods described above. Activation can be achieved by supplementing or even replacing the 

25 homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 

4.10 USES AND BIOLOGICAL ACTIVITY 

30 The polynucleotides and proteins of the present invention are expected to exhibit one or 

more of the uses or biological activities (including those associated with assays cited herein) 
identified herein. Uses or activities described for proteins of the present invention may be 
provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 

35 mechanism underlying the particular condition or pathology will dictate whether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, "therapeutic 
compositions of the invention' 5 include compositions comprising isolated polynucleotides 
(including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
5 polypeptides of the invention (including flill length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overall activity of the 
target gene products, either at the level of target gene/protein expression or target protein 
activity. Such modulators include polypeptides, analogs, (variants), including fragments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 

10 indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening 
assays as described herein); anti sense polynucleotides and polynucleotides suitable for triple 
helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 

15 or in one of the other physiological pathways described herein. 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or therapeutic use; as markers for tissues in which the 

corresponding protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 
positions; to compare with endogenous DNA sequences in patients to identify potential genetic 

25 disorders: as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 
oligomers for attachment to a "gene chip" or other support, including for examination of 
expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 

30 an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 

polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 

35 the binding interaction. 
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The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed„ Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch 
and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
sources or supplements. Such uses include without limitation use as a protein or amino acid 
supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in the 
form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in certain cell populations. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 
or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confirmation of cytokine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation 
assays for cell lines including, without limitation, 32D, DA2, DA1G 9 T10, B9, B9/1 1, BaF3, 
MC9/G, M+(preB M+), 2E8, RB5, DAL 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, 
5 HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 
Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 

10 Humans): Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 

145:1706-1712, 1990; Bertagnolli etal., Cellular Immunology 133:327-341, 1991; Bertagnolli, 
et al., I. Immunol. 149:3778-3783, 1992; Bowman et ah, I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 

15 Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 

eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994: and Measurement of mouse 
and human interieukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottom ly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-121 1, 1991; Moreau et al., Nature 336:690-692, 1988; 
Greenberger et a!., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 

25 and human interleukin 6— Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 1 l~Bennett, F., Giannotti, L, 
Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 

30 9~Ciarletta : A., Giannotti, L, Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, proteins 
that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 

35 Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 
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Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
Immunologic studies in Humans); Weinberger et aL, Proc. Natl. Acad. ScL USA 77:6091-6095, 
1980; Weinberger et al., Eur. J. Tmmun. 1 1:405-411, 1981; Takai et aL, J. Immunol. 
5 137:3494-3500, 1986; Takai et aL, J. Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor activity and 
be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 

10 cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 

germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 
ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 

15 large quantities of human cells has important working applications for the production of human 
proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases: 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors listed herein, other stem cell maintenance factors, and 
specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Fit- 

25 3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 

inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 
fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 

30 these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 
optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 

35 with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
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layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to induce 
5 autocrine expression of the polypeptide of the invention. This will allow for generation of 

undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 
differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 

10 identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 

15 used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 
genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, /. e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 

20 the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 
types. A broadly applicable method of obtaining pure populations of a specific differentiated 

25 cell type from undifferentiated stem cell populations involves the use of a cell-type specific 

promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (W obus 
et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 

30 Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 

accomplished by culturing the stem cells in the presence of a differentiation factor such as 
retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vifro cultures of stem cells can be used to determine if the polypeptide of the invention 

35 exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 
the presence of the polypeptide of the invention alone or in combination with other growth 
factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 
proliferation is determined by colony formation on semi-solid support e.g. as described by 
Bernstein et al., Blood, 77: 2316-2321 (1991). 

4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
biological activity in support of colony forming cells or of factor-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 
cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 
monocytes/macrophages (i.e., traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets Ihereby allowing prevention or 
treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 
those usually treated with transplantation, including, without limitation, aplastic anemia and 
paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 
as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al.. Molecular 
and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81 :2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hemalopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
Freshney, ct al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
5 Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 
al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 
Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
10 Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E.. Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long temi culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

15 

4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of burns, incisions and ulcers. 

20 A polypeptide of the present invention which induces cartilage and/or bone growth in 

circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 
prophylactic use in closed as well as open fracture reduction and also in the improved fixation of 

25 artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming cells, 
stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 

30 bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 
inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 

44 

O 0157188A2J_> 



WO01/57188 — *™ PCT/US0./03800 

Another category of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or ligament 
5 defects in humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 
10 induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 
invention may provide environment to attract tendon- or ligament-forming cells, stimulate 
growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or 
ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 
1 5 return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 
treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The 
compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is 

well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 
20 cells and for regeneration of nerve and brain tissue, i. e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 
peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 
25 system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spinal 
cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 
30 composition of the invention. 

Compositions of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
35 repeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 
to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
5 A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
10 growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described in: 
International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
1 5 W09 1/0749 1 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-1 12 (Maibach, H. I. and Rovee, D. l\, eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84(1978). 

20 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 
herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 

25 protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 
and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 

30 specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 

treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 
herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, /. e. , in the treatment of cancer. 
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Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
autoimmune thyroiditis, insulin dependent diabetes meUitis, myasthenia gravis, graft-versus-host 
disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 
Stevens- Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 
models such as die cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing the induction of an immune 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 
by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 
generally an active, non-antigen-specific, process which requires continuous exposure of the T 
cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 
persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 

of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 
functior should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 
followed by an immune reaction that destroys the transplant. The administration of a therapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 
and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be sufficient 
5 to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it 
may also be necessary to block the function of a combination of B lymphocyte antigens. 
The efficacy of particular therapeutic compositions in preventing organ transplant 

1 0 rejection or GVHD can be assessed using animal models that are predictive of efficacy in 

humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 
rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
aL. Science 257:789-792 (1992) and Turka et al. 5 Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 

15 (1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 

20 reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or eliminate disease symptoms. Administration of reagents which block stimulation of T 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 
cell-derived cytokines which may be involved in the disease process. Additionally, blocking 

25 reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 

long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 
human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune 

30 collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental 

myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

Upregulation of an antigen function (e.g., a B tymphocyte antigen function), as a means 
of up regulating immune responses, may also be useful in therapy. Upregulation of immune 
35 responses may be in the form of enhancing an existing immune response or eliciting an initial 
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immune response. For example, enhancing an immune response may be useful in cases of viral 
infection, including systemic viral diseases such as influenza, the common cold, and 
encephalitis. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
5 removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
patient. Another method of enhancing anti-viral immune responses would be to isolate infected 
cells from a patient, transfect them with a nucleic acid encoding a protein of the present 
1 0 invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 
capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T cell mediated immune response against the transfected tumor cells. In 
1 5 addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 

reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of {e.g. , a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and p 2 microglobulin protein or an MHC class 11 alpha chain 
protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 
20 proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a 1 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 
the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
25 of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 

tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by the 
following methods: 

30 Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 

those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscicncc (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 

35 78:2488-2412, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
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Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. 
Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et al., 
Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 
5 will identify, among others, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

10 Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 

that generate predominantly Thl and CTL responses) include, without limitation, those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 

15 Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988; Bertagnolli et aL, J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, without limitation, those described in: 
Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine 

20 173:549-559, 1991; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et 
al., Journal of Experimental Medicine 1 82:255-260, 1995; Nair et al., Journal of Virology 
67:4062-4069, 1993; Huang et ah, Science 264:961-965, 1994; Macatonia et al., Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 
94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:631-640, 1990. 

25 Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 

that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 
13:795-808, 1992; Gorczyca et ah, Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 
53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 

30 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International 
Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Antica et al., Blood 84:1 1 1-117, 1994; Fine et al., 
Cellular Immunology 155:111-122, 1994; Galy et aL, Blood 85:2770-2778, 1995; Toki etal., 

35 Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 
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4.10.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 

5 characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 

stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female mammals and decrease 

10 spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 
a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 

1 5 polypeptide of the invention may also be useful for advancement of the onset of fertility in 

sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 
animals such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 

20 Assays for activin/inhibin activity include, without limitation, those described in: Vale et 

aL, Endocrinology 91 :562-572, 1972; Ling et al., Nature 321 :779-7S2, 1986; Vale et al., Nature 
321:776-779, 1986; Mason et al, Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. 
USA 83:3091-3095, 1986. 

25 4.10.9 CHEMOTACTIC/CIIEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, 
T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 

30 receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or 
modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 

35 immune response s against the tumor or infecting agent. 
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A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a population of cells can be readily 
5 determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
10 population to another cell population. Suitable assays for movement and adhesion include, 

without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 
15 1995: Mullcr et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
20 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 

attributes. Compositions may be useful in treatment of various coagulation disorders (including 
hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 
in treating wounds resulting from trauma, surgery or other causes. A composition of the 
invention may also be useful for dissolving or inhibiting formation of thromboses and for 
25 treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 
cardiac and central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 
30 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474, 1988. 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
35 metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
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invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
5 condition. Identification of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
10 compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 
cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 
15 cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 

cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including 
bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 
20 kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 
tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 
hemangiopericytoma and Karposi's sarcoma. 
25 Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 

inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 
effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 
chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
30 effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a 
portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically 
35 acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
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Anti -cancer drugs that are well known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, 
Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis- 
DDP), Cyclophosphamide, Cytarabinc HC1 (Cytosinc arabinoside), Dacarbazine, Dactinomycin, 
5 Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate sodium, Etoposide (V 1 6-21 3), 
Floxuridine, 5-Fluorouracil (5-Fu) 3 Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, 
Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
Lomustine, Mechlorethamine HC1 (nitrogen" mustard), Melphalan, Mercaptopurine, Mesna, 
Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, 

10 Streptozocin, Tamoxifen citrate. Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2 3 Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 

15 exposure to carcinogens) known in the art that predispose an individual to developing cancers. 
Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 

20 cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 1 8 and Ch 21), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 
Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 

25 of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., 
Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 
e.g. from American Type Tissue Culture Collection catalogs. 

30 4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 
35 their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 
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and their ligands (including without limitation, cellular adhesion molecules (such as selecting 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and ligands 
are also useful for screening of potential peptide or small molecule inhibitors of the relevant 
5 receptor/ligand interaction. A protein of the present invention (including, without limitation, 
fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand 
interactions. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods: 

1 0 Suitable assays for receptor-ligand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kxuisbeek, D. H. Margulies, E. M. 
Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 
Measurement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Takai et al., Proc. 
Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1 145-1 156, 1988; 
15 Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 
through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 
20 overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 
partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 
present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 
colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 
25 Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic 
Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 
carbon-14 . Examples of colorimetric molecules include, but are not limited to, fluorescent 
molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 
toxins include, but are not limited, to ricin. 

30 

4.10.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
35 solid support, borne on a ce^J surface or located intracellular^. One method of drug screening 
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utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the polypeptide or a fragment thereof. Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
be used for standard binding assays. One may measure, for example, the formation of 
5 complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art. 

Sources for test compounds that may be screened for ability to bind to or modulate (i.e., 
increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 

10 organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

15 The sources of natural product libraries are microorganisms (including bacteria and 

fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 
screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

20 review, see Science 282:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 
oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 

25 peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 
For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. 
Biotechnol 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 
Al-Obeidi et al., Mol Biotechnol 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 
1(1):114-19 (1997); Dorner et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

30 Identification of modulators through use of the various libraries described herein permits 

modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 
polypeptide of the invention. The molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested 

35 for either cell/animal death or prolonged survival of the animal/cells. 
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The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide e.g. a 
ligand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 
to identify polynucleotides encoding binding partners. As another example, affinity 
chromatography with the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different libraries used for the identification of compounds, and in particular small molecules, 
that modulate (i.e., increase or decrease) biological activity of a polypeptide of the invention. 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 
the expression of the receptor of the invention: one cell population expresses the receptor of the 
invention whereas the odier does not. The response of the two cell populations to the addition of 
ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the 
polypeptide of the invention in cells and assayed for an autocrine response to identify potential 
ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known 
in the art can be used to identify binding partner polypeptides, including, (1) organic and 
inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 
polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 
the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 
be assayed for expected modifications i.e. phosphorylation. Other methods known to those in the 
art can also be used to identify signaling molecules involved in receptor activity. 
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4.10.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 
inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
5 cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 
with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 

1 0 shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cytokines such as TNF or IL-1. Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

15 Compositions of this invention may be utilized to prevent or treat conditions such as, but not 
limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 

20 acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 



4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
25 therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, promyelocyte, 
myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 
(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
30 Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 

4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
35 polypeptides of the invention, and which can be treated upon thus observing an indication of 
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therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
human and non-human mammalian patients) according to the invention include but are not 
5 limited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
surgery, for example, lesions which sever a portion of the nervous system, or compression 
injuries; 

10 (ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 

results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 
infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by human 

15 immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 
tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 
with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 

20 sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, 
tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 

25 callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 
sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 

30 neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 
immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 
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Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
any of the following effects may be useful according to the invention: 
5 (i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., 
choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

1 0 Such effects may be measured by any method known in the art. In preferred, 

non-limiting embodiments, increased survival of neurons may be measured by the method set 
forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 

15 be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 

depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 

20 invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 
muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 

25 muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 

4.10.18 OTHER ACTIVITIES 

30 A polypeptide of the invention may also exhibit one or more of the following additional 

activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 

35 (such as, for example, breast augmentation or diminution, change in bone form or shape); 
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effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects: effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 
nutritional factors or component(s); effecting behavioral characteristics, including, without 
5 limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 

(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 
than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
10 hyperproliferative disorders (such as, for example, psoriasis); nnmunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 
in a vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

15 4. 1 0.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 
and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 
20 response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of the 
polymorphism. 

25 Polymorphisms can be identified in a variety of ways known in the art which all 

generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 
polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to 

30 allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 

Irybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 
single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 
adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 

35 enzymes that provide differential digestion of the genomic DNA depending on the pi isence or 
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absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 
nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 
present invention. In the alternative, any one of the nucleotide sequences of the present 
5 invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., 
by an antibody specific to the variant sequence. 

1 0 4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 
arthritis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats 5 and the protocol is described by J. Holoshitz, et at., 1983, 
Science, 219:56, or by B. Waksman et al. 3 1963, Int. Arch. Allergy Appl. Immunol., 23:129. 

15 Induction of the disease can be caused by a single injection, generally intradermally, of a 
. suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 
mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
1-5 mg/kg. The control consists of administering PBS only. 

20 The procedure for testing the effects of the test compound would consist of intradermally 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 
described by J. Holoskitz above. An analysis of the data would reveal that the test compound 

25 would have a dramatic affect on the swelling of the joints as measured by a decrease of the 
arthritis score. 

4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
30 other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
include, but are not limited to, those exemplified herein. 

4.11.1 EXAMPLE 
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One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode of 
administration is not particularly important, parenteral administration is preferred. An 

5 exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 
condition and response of the individual patient. Typically, the amount of polypeptide 
administered per dose will be in the range of about O.Olfig/kg to 100 mg/kg of body weight, with 
10 the preferred dose being about 0. 1 jug/kg to 1 0 mg/kg of patient body weight. For parenteral 

administration, polypeptides of the invention will be formulated in an injectable form combined 
with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the human serum albumin. The vehicle may contain minor amounts of 

1 5 additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. 
The preparation of such solutions is within the skill of the art. 

4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other composition of die present invention (from whatever source derived, 

including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 
carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 

25 may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 

fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredients). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 

30 invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 

M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, 
IL-1 3, IL-14, IL-1 5, IFN, TNF0, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agtnts beneficial to the treatment of the disease or disorder in question. These agents 

35 include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
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factor (PDGF), transforming growth factors (TGF-a and TGF-p), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either enhance 
the activity of the protein or other active ingredient or complement its activity or use in 
5 treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
present invention may be included in formulations of the particular clotting factor, cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or antithrombotic factor, or anti- 

10 inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thronibotic factor, or anti-inflammatory agent (such as 
IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 
of the present invention may be active in multimers (e.g., heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, pharmaceutical compositions of the 

15 invention may comprise a protein of the invention in such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 

20 Techniques for formulation and administration of the compounds of the instant application may 
be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the 
relevant medical condition, or an increase in rate of treatment, healing, prevention or 

25 amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 
ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

30 In practicing the method of treatment or use of the present invention, a therapeutically 

effective amount of protein or other active ingredient of the present invention is administered to 
a mammal having a condition to be treated. Protein or other active ingredient of the present 
invention may be administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines or other 

35 hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 
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hematopoietic factors, protein or other active ingredient of the present invention may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, 
the attending physician will decide on* Hie appropriate sequence of administering protein or other 
5 active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors. 

4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 
1 0 intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 
method of the present invention can be carried out in a variety of conventional ways, such as oral 
1 5 ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 
or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often 
in a depot or sustained release formulation. In order to prevent the scarring process frequently 
20 occurring as complication of glaucoma surgery, the compounds may be administered topically, 
for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 
afflicted tissue. 

25 The polypeptides of the invention are administered by any route that delivers an effective 

dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
ranges for the polypeptides of the invention can be extrapolated from these dosages or from 

30 similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit. 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may 
35 be formulated in a conventional manner using one or more physiologically acceptable carrier'- 
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comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutical^. These pharmaceutical compositions may be 
manufactured in a manner that is itself known, e.g., by means of conventional mixing, 
dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
5 Iyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 

10 carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 
90% protein or other active ingredient of the present invention. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oiL soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 

15 pharmaceutical composition may further contain physiological saline solution, dextrose or other 
saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of a pyro'gen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 
active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 

25 the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 
Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 
other vehicle as known in the art. The pharmaceutical composition of the present invention may 

30 also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 
solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are generally known 

35 in the art. 
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For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutical^ acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
5 treated. Pharmaceutical preparations for oral use can be obtained from a solid excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
10 gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 

carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
1 5 talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 
added to the tablets or dragee coatings for identification or to characterize different combinations 

of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
20 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
25 stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
30 pressurized packs or a nebuliser, with the use of a suitable propellant, e.g. , 

diclilorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 
other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use 
in an inhaler or insufflator may be formulated containing a powder mix of the compound and a 
35 suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 
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administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g. , in ampules or in multi-dose containers, with 
an added preservative. The compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 
5 stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 
vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 

10 triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 

increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in powder form for constitution with a 

15 suitable vehicle, e.g.„ sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 

20 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 

25 system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3?^ w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 
polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent 

30 system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 
without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 

35 biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
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sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
5 Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various types of sustained-release materials have been established and are well known by those 
skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
10 biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 
or excipicnts. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
15 polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 
provided as salts with pharmaceutical^ compatible counter ions. Such pharmaceutical^ 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the free acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 
20 monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingredients) of present invention along with protein or peptide 
antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
25 lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 
those encoded by class 1 and class II MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
30 MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 
well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 
35 which protein of the present invention is combined, in addition to other pharmaceutically 
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acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 
Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated 
herein by reference. 

The amount of protein or other active ingredient of the present invention in the 
pharmaceutical composition of the present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein or other active 
ingredient of the present invention with which to treat each individual patient. Initially, the 
attending physician will administer low doses of protein or other active ingredient of the present 
invention and observe the patient's response. Larger doses of protein or other active ingredient 
of the present invention may be administered until the optimal therapeutic effect is obtained for 
the patient, and at that point the dosage is not increased further. Tt is contemplated that the 
various pharmaceutical compositions used to practice the method of the present invention should 
contain about 0.01 |ag to about 100 mg (preferably about 0.1 |xg to about 10 mg, more preferably 
about 0. 1 ng to about 1 mg) of protein or other active ingredient of the present invention per kg 
body weight. For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable 
form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in the composition as 
described above, may alternatively or additionally, be administered simultaneously or 
sequentially with the composition in the methods of the invention. Preferably for bone and/or 
cartilage formation, the composition would include a matrix capable of delivering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 
cartilage damage, providing a structure for the developing bone and cartilage and optimally 
capable of being resorbed into the body. Such matrices may be fonned of materials presently in 
use for other implanted medical applications. 
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The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 
compositions will define the appropriate formulation. Potential matrices for the compositions 
may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 

5 hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 
matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 
1 0 mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 
calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in the form of porous particles having diameters ranging from 150 to 800 microns. 

1 5 In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 

cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

A preferred family of sequestering agents is cellulosic materials such as alkylceltuloses 
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 

20 hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
polyethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and polyvinyl alcohol). 
The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 

25 total formulation weight, which represents the amount necessary to prevent desorption of the 

protein from the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 
protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 
compositions, proteins or other active ingredients of the invention may be combined with other 

30 agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 

question. These agents include various growth factors such as epidermal growth factor (EGF), 
platelet derived growth factor (PDGF), transforming growth factors (TGF-ct and TGF-0), and 

insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
35 Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
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patients for such treatment with proteins or other active ingredients of the present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be fonned, the site of 
5 damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
and with inclusion of other proteins in the pharmaceutical composition. For example, the 
addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final 

10 composition, may also effect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example. X-rays, histomorphometric determinations and 
tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 

15 mammalian subject. Polynucleotides of the invention may also be administered by other known 
methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

20 

4.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 
intended purpose. More specifically, a therapeutically effective amount means an amount 

25 effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used in 
the method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 

30 circulating concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 
concentration range that includes the lC 5 o as determined in cell culture (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses in humans. 
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A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the LD 50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD50 and ED50. Compounds which exhibit high therapeutic indices are preferred. 
The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED 50 with little or no toxicity. The dosage may 
vary within this range depending upon the dosage form employed and the route of administration 
utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount and interval may be adjusted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 
desired effects, or minimal effective concentration (MEC). The MEC will vary for each 
compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen which maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 

related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 
will be in the range of about 0.01 ug/kg to 100 mg/kg of body weight daily, with the preferred 
dose being about 0.1 ug/kg to 25 mg/kg of patient body weight daily, varying in adul* and 
children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

4.12.4 PACKAGING 
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The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may. for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
be accompanied by instructions for administration. Compositions comprising a compound of the 
5 invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 

4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 

10 invention. The term "antibody" as used herein refers to immunoglobulin molecules and 

immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain 
an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F ab; F ab - and F( a b')2 
fragments, and an F a b expression library. In general, an antibody molecule obtained from 

15 humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 
by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgGi, IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

20 An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fragment thereof, and additionally can be used as an imrnunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

25 antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of the full length protein, (for example the amino acid sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope thereof such that an antibody raised against the peptide forms a 
specific immune complex with the full length protein or with any fragment that contains the 
epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 

30 15 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 
located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of -related protein that is located on the surface of the protein, e.g., a 

35 hydrophilic region. A hydrophobicity analysis of th s human related protein sequence will 
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indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely 
to encode surface residues useful for targeting antibody production. As a means for targeting 
antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobic^ 
may be generated by any method well known in the art, including, for example, the Kyte 
5 Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g. , 
Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. 
Mol. Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 
fragments, analogs or homologs thereof, are also provided herein. 
10 A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 

thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal or 
monoclonal antibodies directed against a protein of the invention, or against derivatives, 
1 5 fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 



5.13.1 Polyclonal Antibodies 

20 For the production of polyclonal antibodies, various suitable host animals {e.g. , rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
25 recombinant^ expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further include an 
adjuvant. Various adjuvants used to increase the immunological response include, but are not 
30 limited to, Freund's (complete and incomplete), mineral gels {e.g. , aluminum hydroxide), surface 
active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 
Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
adjuvants which can be employed include MPL-TDM adjuvant (monopbosphoryl Lipid A, 
3 5 synthetic trehalose dicorynomycolate). 
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The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal {e.g., from the blood) and further purified by well known techniques, 
such as affinity chromatography using protein A or protein G, which provide primarily the IgG 
fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 
5 target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to 
purify the immune specific antibody by immunoaffinity chromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 
Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

10 5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain 
gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal 

15 antibody are identical in all the molecules of the population. MAbs thus contain an antigen 

binding site capable of hnmunoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1 975). In a hybridoma method, a mouse, 

20 hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 

25 origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 
Antibodies: Principles and Practice . Academic Press, (1986) pp. 59-103). Immortalized cell 
lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 

30 human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfiised, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 

35 thymidine ("HAT medium"), which substances prevent thn growth of HGPRT-deficient cells. 
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Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 
can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 
5 California and the American Type Culture Collection Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol. . 133:3001 (1984); Brodeur et al., Monoclonal 
Antibody Production Techniques and Applications , Marcel Dekker, Inc., New York, (1987) pp. 
51-63). 

10 The culture medium in which the hybridoma cells are cultured can then be assayed for 

the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 

15 art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem. , 107:220 (1 980). Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 
20 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
25 as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
30 oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
3 5 monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 
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example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 
812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
5 polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

5.13.2 Humanized Antibodies 

10 The antibodies directed against the protein antigens of the invention can further comprise 

humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- 

15 binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature , 321 :522-525 (1 986); Riechmann et al., Nature , 332:323-327 (1988); Verhoeyen et al., 
Science , 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 

20 corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable 

25 domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol. . 

30 2:593-596 (1992)). 

5.13.3 Human Antibodies 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
35 genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
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Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 
5 antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1 985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
10 including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. , 227:381 (1991); 
Marks et al., J. Mol. Biol. , 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
15 in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 
is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et al. 
(Nature 368 856-859 (1994)); Morrison (Nature 368, 812-13 (1994)); Fishwild et al,( Nature 
Biotechnology 14, 845-5 1 (1996)); Neuberger (Nature Biotechnology 14, 826 (1996)); and 
20 Lonberg and Huszar ^Intern. Rev. Immunol. 13 65-93 (1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 
endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
25 have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 
provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
30 preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse 1 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 
cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from 
the animal after immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
35 hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
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immunoglobulins with human variable regions can be recovered and expressed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 
example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 
5 expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 

5,939,598. It can be obtained by a method including deleting the J segment genes from at least 
one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain locus, 
the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 

10 and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 
contain the gene encoding the selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is disclosed in 
U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 
nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 

15 an expression vector containing a nucleotide sequence encoding a light chain into another 

mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 
antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 
epitope on an immunogen, and a correlative method for selecting an antibody that binds 

20 immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 
WO 99/53049. 

5.13.4 F ab Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of single-chain 
25 antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of F ab expression libraries (see e.g., 
Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 
monoclonal F a b fragments with the desired specificity for a protein or derivatives, fragments, 
analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen 
30 may be produced by techniques known in the art including, but not limited to: (i) an F (ab ') 2 

fragment produced by pepsin digestion of an antibody molecule; (ii) an F ab fragment generated 
by reducing the disulfide bridges of an F( ab ' )2 fragment; (iii) an F ab fragment generated by the 
treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments. 

35 5.13.5 Bispecific Antibodies 
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Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 
5 Methods for making bispecific antibodies are known in the art. Traditionally, the 

recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
10 potential mixture often different antibody molecules, of which only one has the correct 

bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
cliromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and in Traunecker et al, 1991 EMBO J., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
1 5 combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 

preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 
20 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
a ] Methods in Enzvmologv , 121:210 (1986). 

~ According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
25 recovered from recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 
30 acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 
F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 
35 prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure 
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wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab 5 fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
5 derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab 5 -TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 

10 coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' fragment 
was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 

15 of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly from 
recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostelny et al., J. Immunol. 148(5): 1547-1 553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a 

25 heavy-chain variable domain (Vh) connected to a light-chain variable domain (Vl) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and Vl domains of one fragment are forced to pair with the complementary Vl and Vh 
domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 

30 reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al. a J. Immunol. 147:60 (1991). 
Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 

35 immunoglobulin molecule can be combined with an arm which binds to a'triggering molecule on 
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a leukocyte sueh as a T-cell reeeptor molecule (e.g. CD2, CDS, CD2S, or B7), or Fc receptors for 
IgG(Fc R),suchasFc RI(CD64),Fc RII (CD32) andFc RIII (CD 16) so as to focus 
cellular defense mechanisms to the ceil expressing the particular antigen. Bispecific antibodies 
can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
5 antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a 

radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody 
of interest binds the protein antigen described herein and further binds tissue factor (TF). 

5.13.6 Heteroconjugate Antibodies 

1 0 Heteroconjugate antibodies are also within the scope of the present invention. 

Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 
have for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HTV infection (WO 91/00360; WO 92/200373- EP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 
15 protein chemistry, including those involving crosslinking agents. For example, immunotoxins 
can be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

20 5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector function, so as 
to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond 
formation in this region. The homodimeric antibody thus generated can have improved 
25 internalization capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1191-1195 (1992) 
and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 
tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff 
et al. Cancer Research, 53 : 2560-2565 (1993). Alternatively, an antibody can be engineered that 
30 has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 



5.13.8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
35 cytotoxic agent such as a cher therapeutic agent, toxin (e.g. , an enzymatically active toxin of 
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bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
described above. Enzymatically active toxins and fragments thereof that can be used include 
5 diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPIL and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogcllin. restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 

.10 radionuclides are available for the production of radioconjugated antibodies. Examples include 
212 Bi 3 ,31 I, 13, In 3 90 Y,and ,86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithioI) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 

1 5 active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 
compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2 5 6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitro benzene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). 

20 Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
25 administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 

4.14 COMPUTER READABLE SEQUENCES 

30 In one application of this embodiment, a nucleotide sequence of the present invention can 

be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium which can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 

35 and ROM; and hybrids of these categories such as magnetic/optical storage nedia. A skilled 
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artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for 
storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 
readable medium. The sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

By providing any of the nucleotide sequences SEQ ID NO:1-1350 or a representative 
fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 
sequences of SEQ ID NO: 1-1 3 50 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples which follow demonstrate how software which implements the 
BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used in 
fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the present invention. As stated above, the 
computer-based systems of the prese nt invention comprise a data storage means having stored 
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therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
invention, or a memory access means which can access manufactures having recorded thereon 
5 the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 

1 0 motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 
systems of the present invention. Examples of such software includes, but is not limited to, 
Smith- Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 

1 5 software packages for conducting homolog}' searches can be adapted for use in the present 

computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 

20 sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of 
shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
25 selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There are 
a variety of target motifs known in the art. Protein target motifs include, but are not limited to, 
enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
30 sequences). 



4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 
35 methods are based on the binding of a polynucleotide sequence to DNA or RNA. 
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Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 15241:456 (1988); and Dervan 
et al., Science 251 :1 360 (1991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 

5 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RN A 
transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA 
molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 

1 0 design of an antisense or triple helix oligonucleotide. 

4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
1 5 acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 

with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 
20 detected, a polynucleotide of the invention is detected in the sample. Such methods can also 

comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 
detected in the sample. 

25 In general, methods for detecting a polypeptide of the invention can comprise contacting 

a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 
polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
30 antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection melhods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 
3 5 skilled in the art will recognize that any one of the commonly available hybridization, 
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amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Tmmunocytochemistry, 
5 Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the 
present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 

10 will vary based on the assay format, nature of the detection method and the tissues, cells or 

extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 
extracts of cells are well known in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 

15 necessary reagents to carry out the assays of the present invention. Specifically, the invention 
provides a compartment kit to receive, in close confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of a bound probe or antibody. 

20 In detail, a compartment kit includes any kit in which reagents are contained in separate 

containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 
agents or solutions of each container can be added in a quantitative fashion from one 

25 compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which 
contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 

30 primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 

reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 
probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 

35 4.17 MEDICAL IMAGING 
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The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
See, e.g., Kunkel et al. ? U.S. Pat. NO. 5,413,778. Such methods involve chemical attachment of 
5 a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 

pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

4.18 SCREENING ASSAYS 
10 Using the isolated proteins and polynucleotides of the invention, the present invention 

further provides methods of obtaining and identifying agents which bind to a polypeptide 

encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO:l- 

1350, or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, said 

method comprises the steps of: 
1 5 (a) contacting an agent with an isolated protein encoded by an ORF of the present 

invention, or nucleic acid of the invention; and 

(b) detejmining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
20 the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 

the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 

to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 

polypeptide of the invention can comprise contacting a compound with a polypeptide of the 
25 invention for a time sufficient to form a polypeptide/compound complex, and detecting the 

complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 

polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can also 

comprise contacting a compound with a polypeptide of the invention in a cell for a time 
30 sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 

receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 

sequence expression, so that if a polypeptide/compound complex is detected, a compound that 

binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 
35 activity of a polypeptide of the invention (th< t is, increase or decrease its activity, relative to 
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activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds which modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
absence of the compound). Compounds, such as compounds identified via the methods of the 
5 invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

1 0 For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 

the like are selected at random and are assayed for their ability to bind to the protein encoded by 
the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 

15 readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

20 In addition to the foregoing, one class of agents of the present invention, as broadly 

described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single ORF or 

25 multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RN A. Such agents can be based on the classic phosphodiester, 
ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have 
base attachment capacity. 

30 Agents suitable for use in these methods preferably contain 20 to 40 bases and are 

designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et 
al., Science 251 :1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991): Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
35 Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RN A transcription 
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from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide and other DNA binding agents. 
5 Agents which bind to a protein encoded by one of the ORFs of the present invention can 

be used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 
composition. 

10 4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NO:1-1350. Because the corresponding gene is only expressed in a limited 
15 number of tissues, a hy bridization probe derived from of any of the nucleotide sequences SEQ 
ID NO:1-1350 can be used as an indicator of the presence of RNA of cell type of such a tissue in 
a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 

20 additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 
Other means for producing specific hybridization probes for nucleic acids include the 

25 cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the ait and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 
polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 

30 nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 
techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
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clixomosome spreads has been described, among other places, in Verma et al (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 
chromosome mapping techniques may be correlated with additional genetic map data. Examples 
5 of genetic map data can be found in the 1 994 Genome Issue of Science (265 : 1 98 1 f). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 
genetic disease. The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected individuals. 

1 0 4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, /.e, small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 
using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those of 
1 5 skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be 
achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol. 28(6) 1469-72); 
using UV light (Nagatae/ al, 1985; Dahlen et al, 1987; Morrissey & Collins, (1989) MoL Cell 
Probes 3(2) 1 89-207) or by covalent binding of base modified DNA (Keller et al, 1 988; 1 989): all 
20 references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al (1994)Proc. Natl. Acad. Sci. USA 91(8) 3072-6, 
describe the use of biotinylated probes, although these are duplex probes, that are immobilized on 
streptavi din-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, 
25 Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 
Biotinylated probes may be purchased from various sources, such as, e.g., Operon Technologies 
(Alameda, CA). t 

Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nunc 
Laboratories have developed a method by which DNA can be covalently bound to the microwell 
3 0 surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be 
purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 
5'-end by a phosphoramidate bond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussen (1991) Anal. Biochem. 198(1) 138-42). 
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The use of CovaLinkNH strips for covalent binding of DNA molecules at the 5'-end has 
been described (Rasmussenet al., (1991). In this technology, a phosphoramidatebond is employed 
(Chu et al., (1 983) Nucleic Acids Res. 1 1(8) 65 13-29). This is beneficial as immobilization using 
only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 
CovaLinkNH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to 
CovaLink NH via an phosphoramidatebond, the oligonucleotide terminus must have a 5'-end 
phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 
then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denaturingfor 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1-methylimidazole, 
pH 7.0 (1-Melm 7 ), is then added to a final concentration of 10 mM 1-Melm 7 . A ss DNA solutionis 
then dispensed into CovaLinkNH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M l-emyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), dissolved in 
10 mM 1-Melm 7 , is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 
50°C. After incubation the strips are washed using, e.g. , Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 
3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 
described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported 
nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
arrays may be employed. For example, addressable laser-activated photodeprotection may be 
employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodore/ al (1991) Science 251(4995) 767-73, incorporated herein by reference. Probes may also 
be immobilized on nylon supports as described by Van Ness et al. (1991)Nucleic Acids Res. 
19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 
169(1) 104-8; all references being specifically incorporated herein. 
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To link an oligonucleotide to a nylon support, as described by Van Ness et al (1991), 
requires activation of the nylon surface via alkylation and selective activation of the 5 -amine of 
oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al } (1994) PNAS USA 91(1 1) 5022-6, incorporated 
herein by reference). These authors used current photolithographic techniques to generate arrays of 
immobilized oligonuclcotideprobcs (DNA chips). These methods, in which light is used to direct 
the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 
5 '-protected A r -acyl-deoxynucleosidephosphoramidites, surface linker chemistry and versatile 
combinatorial synthesis strategics. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
including mRN A without any amplification steps. For example, Sambrook et al (1989) describes 
three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-9.23). 

DNA fragments may be prepared as clones in M13, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples 
may be prepared or dispensed in multiwell plates. About 1 00- 1 000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skill 
in the art including, for example, using restriction enzymes as described at 924-9.28 of Sambrook et 
al (1 989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1990) Nucleic 
Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
device allows controlled application of low to intermediate pressures to the cell. The results of 
these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 
fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
base recognition endonuclease, Cv/JI, described by Fitzgerald et al (1 992) Nucleic Acids Res. 
20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
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of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 
sequencing. 

The restriction endonuclease CviJl normally cleaves the recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
5 this enzyme (CviJI* *), yield a quasi-random distribution of DNA fragments form the small 
molecule pUC 19 (2688 base pairs). Fitzgerald etal (1992) quantitatively evaluated the 
randomness of this fragmentation strategy, using a CviJI** digest of pUC19 that was size 
fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
Ml 3 cloning vector. Sequence analysis of 76 clones showed that Cv/JI* * restricts pyGCPy and 
1 0 PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 
agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
1 5 electrophoresis and elution are needed 

Irrespective of the manner in which the nucleic acid fragments are obtained or prepared , it is 
important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturationof the DNA fragments before they are contacted with the 
20 chip. Phosphate groups must also be removed from genomic DNA by methods known in the art. 

4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using array s of metal pins (the positions of which correspond to an 
array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a 

25 nylon membrane. By offset printing, a density of dots higher than the density of the wells is 

achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 
may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 

30 subarrays may represent replica spotting of the same samples. In one example, a selected gene 

segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in 
one 96- well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. 
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Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
5 being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration of the 
present disclosure, one of skill in the art will appreciate that many other embodiments and variations 

1 0 may be made in the scope of the present invention. Accordingly, it is intended that the broader 
aspects of the present invention not be limited to the disclosure of the following examples. The 
present invention is not to be limited in scope by the exemplified embodiments which are intended 
as illustrations of single aspects of the invention, and compositions and methods which are 
functionally equivalent are within the scope of the invention. Indeed, numerous modifications and 

1 5 variations in the practice of the invention are expected to occur to those skilled in the art upon 
consideration of the present preferred embodiments. Consequently, the only limitations which 
should be placed upon the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated by 
reference in their entirety. 

20 5.0 EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 
human tissues and in some cases isolated from a genomic library derived from human chromosome 

25 using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PCR using primers specific for the vector sequences 
which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and 
screened with oligonucleotide probes (e.g., 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences. Representative clones were selected for 

30 sequencing. 

In some cases, the 5* sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems 
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(ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random 
Amplification of cDNA Ends) was performed to further extend the sequence in the 5 5 direction. 

5.2 EXAMPLE 2 

5 Novel Contigs 

The novel contigs of the invention were assembled from sequences that were obtained from 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
from one or more public databases. The sequences for the resulting nucleic acid contigs are 
designated as SEQ ID NO: 1-1350 and are provided in the attached Sequence Listing. The contigs 

1 0 were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 
{i.e., Hyseq's database containing EST sequences, dbEST version 1 14, gb pri 1 14, and UniGene 
version 101) that belong to this assemblage. The algorithm terminated when there was no 
additional sequences from the above databases that would extend the assemblage. Inclusion of 

1 5 component sequences into the assemblage was based on a BLASTN hit to the extending 
assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

Table 3 sets forth the novel predicted polypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO: 1 89-282) of the present invention, and their corresponding 
nucleotide locations to each of SEQ ID NO: 1 89-282. Table 3 also indicates the method by which 

20 the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 

program called FASTY (available from http://fasta.bioch.virginia>edu) which selects a polypeptide 
based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. 
Pearson, Methods in Enzymology, 1 83 :63-98 (1 990), herein incorporated by reference). Method B 
refers to a polypeptide obtained by using a software program called GenScan for human/vertebrate 

25 sequences (available from Stanford University, Office of Technology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol. Biol, 268:78-94 (1997), incorporated herein by reference). Method C refers 
to a polypeptide obtained by using a Hy seq proprietary software program that translates the novel 
polynucleotide and its complementary strand into six possible amino acid sequences (forward and 

30 reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-1350 were obtained by a BLASTP 
version 2.0al 19MP-WashU search against Genpept release 120 and Geneseq database October 
12, 2000, update 21 (Derwent), using BLAST algorithm. The nearest neighbor result showed the 
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closest homologue for SEQ ID NO: 1-1350. The nearest neighbor results for SEQ ID NO: 1 - 
1 350 are shown in Table 2 below. 

Tables 1 5 2 and 3 follow. Table 1 shows the various tissue sources of SEQ ID NO: 1-1350. 
Table 2 shows the nearest neighbor result for the assembled contig. The nearest neighbor result 
shows the closest homolog with an identifiable function for each assemblage. Table 3 contains the 
start and stop nucleotides for the translated amino acid sequence for which each assemblage 
encodes. Table 3 also provides a correlation between the amino acid sequences set forth in the 
Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID NO. in 
USSN 09/496,914. 
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TABLE 1 



Tissue Origin 


RNA Source 


Hyseq Library Name 


SEQ ID NOS: 


adult brain 


GIBCO 


AB3001 


111 151 188 215 662-665 877 910 927 
976 1233 1319 


adult brain 


GIBCO 


ABD003 


41 49 74 101 111 120 132 141-142 151 
217 225 238 271 317 404 446 469 503 
513-514 535 550 564 573 666-669 798 
898 910 927 976 1067 1083 1085 1178 
1254 


adult brain 


Clontech 


ABR001 


39 216 238 327 356 535 927 1056 1 121 
1178-1180 1199 1251 


adult brain 


Clontech 


ABR006 


74 611 949 1034 1136 


adult brain 


Clontech 


ABR008 


14 32 41 61 81 86 89 120 132 138 145 
147 188 197 208 225 227-239 250 300- 
303 312 316 32S-331 340 357-362 374 
380 384-391 408 414 446 448 464-467 
483 488 495-496 505 512 521 535 550 
566 571 577 585 590 594 598 634 641 
658 666 683 725 742 764 767 786 801 
805 810 823 826 829 831 836 841 887- 
923 927 934 943 950-95 1 963 976 995 
1000-1001 1006 1026 1034 1048 1057- 
1067 1086 1088 1090 1118 1120 1122- 
1 128 1 142 1 162 1181-1 192 1 199 1204 
1218-1219 1225 1232 1253 1267 1271- 
1306 1342 1347 1349-1350 


adult brain 


Clontech 


ABR011 


49 238 1219 


adult brain 


BioChain 


ABR012 


74 238 


adult brain 


Invitrogen 


ABR013 


868 1268 


adult brain 


Invitrogen 


ABT004 


49 1 17 138 191 217 252 291 305 535 
566 596 663 670 746 798 816-819 876 
892 89S 922 943 963 1034-1036 1 121 


cultured 
preadipocytes 


Strategene 


ADP001 


41 74 101 138 21 1 238 304 537 582 
740 798 883 943 976 1067 


adrenal gland 


Clontech 


ADR002 


49 74 101 111 120 127 151 215 238 
240-247 316 330 363-364 404 414 534- 
535 833 924-940 950 963 976 1001 
1003 1067-1070 1118 1156 1193-1200 

1325 


adult heart 


GIBCO 


AHR001 


38 49 71-72 74-77 79 92 99 101 111 
118 129 132 138 151 158-163 182 195- 
203 215 217 238 264 269 353 384 398 
408 434-439 446 504 512-513 519 537 
562-573 577 611-614 616-619 658 661 
671-672 722 734 757-773 815 828-835 
874 891 898 919 926-927 976 988 
1021 1037 1041 1062 1067 1071 1080 
1083 1093 1122 1131 1185 1201 1254 
1308 1331 1335 


adult kidney 


GIBCO 


AKD001 


41 49 51 71-74 78-85 94 100-101 103- 
107 111 119-120 138 151 157215217- 
21 8 238 250 264 294 304 384 404 440 
446 454 477 504-505 509 514 518-519 
535 537 564 574-583 620-627 639 653 
673-675 705 753 789 831 S44 851 859 
877 909 918 927 956 963 976 1067 
1074 1083 1095 1 178 1302 1331 1335 


adult kidney 


Invitrogen 


AKT002 


11-12 41 49 111-112215-217294 316 
446 487 564 575 844 868 910 927 976 
1116 


adult lung j G1BCO 


ALG001 


8 101 111 151 187 402 446 490 514 
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Tissue Origin 


RNA Source 


Hyseq Library Name 


SEQ ID NOS: 








5 1 8 537 545 549 580 582 592 594 634 
640 651-652 676-678 725 851 873 918 
952 976 1042 1067 1076 108j 1152 


lymph node 
_ 


Clontech 


ALN001 


8 111 121 151 180-182 1S8 215 537 
545 549 651 679-682 789 804-810 868 
6/3 927 952 976 1042 1059 1335 


young liver 




AL,VUU1 


O /LA 1 r% 1 11 1 O/* 1 C 'M / 'MO A A f CI A 

5 64 79 1 1 1 186 215-216 238 446 514 
519 537 564 653 683-684 698 753 798 
813 833 840 858 927 976 1038-1039 
1051 1085 1224 1245 1256 


adult liver 


Invitrogen 


ALV002 


40 71 292-293 305 384 468-469 496 
505 657 675 714 753 832 844 941-942 
976 1040 1076 1256 1293 


adult liver 


Clontech 


ALV003 


976 


adult ovary 


Invitrogen 


AOV001 


8 32 36 38 41 49 51 71 74 79-80 101 
104 1 1 1 120 122-125 138 140 143-149 
151 188-190 207-212 215-217 238 264 
3 1 6 384 409 440 445-446 496 504 5 12 
514 518-519 535 537 549-550 564 566 
571 580 582 600 618 638 657 661 6%\ 
685-697 699 705 722 735-744 761 771 
815 833 842-865 868 875-876 918 926- 
927 950 952 963 976 1023 1042 1048 
1051 1059 1072 1076 1083 1117 1120 
1 124 1 131 1144 1174 1224 3268 133 3 
1335 


adult placenta 


Clontech 


APL001 


102 217 238 537 641 700 


placenta 


Invitrogen 


APL002 


663 851 1048 


adult spleen 


GIBCO 


ASP001 


8 45 74 111 132 140 151 185 217 238 
294 414 446 477 504 5 14 534 545 549 
592 722 873 883 952 976 1041-1042 
1083 1093-1094 1152 1224 


testis 


GIBCO 


ATS001 


72 107 111 113 126 140 151 383 215 
238 446 497 537 642 701-706 81 1 877 
927 962 976 1083 1117 1131 


adult bladder 


Invitrogen 


BLD001 


41 151 191 402-405 409 414 496 545 
592 607 706 873 952 1178 1329-1335 


bone marrow 


Clontech 


BMD001 


8 58-62 65-68 74 79 108 111 116 137 
147 151 164-174 213-215 238 305-307 
374 404 446 460 466 5 16 519 534 538- 
541 544-546 549-554 566 584 586 592 
596 607 610 628-629 643-645 652 707- 
708 774-789 844 866-871 873 919 927 
952 963 976 998 1034 1042 1064 1083 
1085 1120 H32 H52 1225 1229 1268 
1307 1310 


bone marrow 


Clontech 


BMD002 


6 8 37-38 52 74 77 105 111 129 132 
210 317 510-51 1 545 549 581 :>98 628 
638 724 766 789 844 860 868 873 939 
927 952 963 968 976 1042 I 111 H4l 
H60-H61 I229 1266 1346 


bone marrow 


Clontech 


BMD004 


111 238 282 549 1 083 


adult colon 


Invitrogen 


CLN001 


52 260 264 299 494 536 545 564 592 
844 87^> 677 952 976 1042 1152 126S 
1336-1337 


adult cervix 


BioChain 


CVX001 


49 51 129 132 151 205 207 238 332- 
335 365-367 392-401 440 466 470-471 
5 1 8 537 597 629 832 877 927 976 1006 
1085 1117 1129-1134 1192 1202-1205 
1219 1309-1328 


diaphragm 


BioChain 


D1A002 


74 976 1083 
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Tissue Origin 


RNA Source 


Hyseq Library Name 


SDQ ID NOS: 


endothelial cells 


Strategene 


EDT001 


32 40-41 49 74 79 101 111 120 132 
no i <i 204-206 215-° 17 238 269 316 
414 433 505 510 513 550 555 580 582 
596 675 722 745 798 814 836-841 851 
918 976 1041 1043 1073 1083 1131 
1331 


Genomic clones 
from the short arm 
of chromosome 8 


Genomic DNA 
from Genetic 
Research 


EPM001 


525-532 927 


Genomic clones 
from the short arm 
of chromosome 8 


Genomic DNA 
from Genetic 
Research 


EPM003 


47 525 


Genomic clones 
from the short arm 
of chromosome 8 


uenomic ujna 
nrom vjencuu 
Research 


or^iviuoH 


525 927 


Genomic clones 
from the short arm 
of chromosome 8 


Genomic DNA 
from Genetic 
Research 


EPM005 


531 


esophagus 


BioChain 


ESO002 


74 138 238 


fetal brain 


Clontech 


FBR001 


441-442 927 


fetal brain 


Clontech 


FBR004 


7 1 S £Cn Q77 1 DDI 

ZlJ OiO 1 1\J\J1 


fetal brain 


Clontech 


rBKOUo 


Afl £1 101 170 1^2 1^8 140 147 208 
225 271 317 319 336 359 368 405-414 
519 550 571 594 686 715 722 764 824 
S7Q RSO Q09 927 943 947 963 10^7 
1067-106S 1104 1135-1140 1162 1206- 
1207 1235 1268 128S 1307-1308 1319 
1338-1350 


fetal brain 


Clontccn 


r jdxvsuj 


111 446 


fetal brain 


Invitrogen 




4151 120 151 192-194 264 504 512 
535 683 761 798 820-827 844 876 909 
963 976 1026 1048 1083 1 144 1302 


fetal heart 


invitrogen 


FRR001 


446 566 761 


fetal kidney 


Clontech 


FKJD001 


51 74 111 127 140 151 184 294 537 
550 630-631 1319 


fetal kidney 


Clontech 


FKD002 


111976 1083 


fetal kidney 


Invitrogen 


FKD007 


238 974 


fetal lung 


Clontech 


FLG001 


463 566 976 1074 1083 1093 


fetal lung 


Invitrogen 


FLG003 


41 238 330 407 415-416 537 573 844 
859 1048 1083 1116 1192 


fetal liver-spleen 


Columbia 
University 


FLS001 


8 14 34-35 37 41 43 49 51 54-56 63-64 
69-73 74 77 79 87-90 101 107 /10-111 
114 120 128-131 138 140 147 150-155 
197 210 215 217 225 238 312 367 384 
414 440 446 460 468 483 496 504-507 
511-515 518-519 523 533-535 537 541 
544-545 547-550 555-560 564 566 571 
577 582 585-586 598 636 646-647 649 
£^7 AAA 6QR 700-710 714 797-723 731 
735-736 746-753 761 784 798 823 829 
832 844 851 858-859 868 873 876 898 
927 943 949 952 963 976 984 1002 
1021 1023 1040 1042 1044 1050 1083 
1093 1116 1120 1129 1131 1144 1174 
1217 1251 1254 1256 1302 1308 1311 
1319 


fetal liver-spleen 


Columbia 
University 


FLS002 


8 36-37 41-46 49 54 64 71 74 79 101 
111 120 129 147 207 210 215-216 238 
250 330 353 359 366 383-384 414 478 
505 508-509 511 515-524 534-535 537 
544-545 564 566 571 577 591 598 638 
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Tissue Origin 


RNA Source 


Hyseq Library Name 


SEQ ID NOS: 








663 671 698 714 722 725 727 751 798 
851 S59 873 876 909 927 949 952 983- 
984 1002 102^ 1042-1044 1085 1095 
IIjI 1 144 1 1/5 1 199 1233 1240- 1270 
1331 1340 


fetal liver-spleen 
— - — — 


Columbia 
University 


FLS003 


64 535 976 1256 


fetal liver 


In vitro gen 


ni \7f\f\i 

rl_,VU(Jl 


8 101 120 138 217 446 468 53d 566 
580 722 730 749 844 918 943 976 1051 
1256 1331 


fetal liver 


Clontech 


FLV004 


537 926 1256 


fetal muscle 
— — — 


Invitrogen 


FMS001 


51 111 264 312 369-370 404 417-421 
425 535 537 577 598 614 836 857 1 141 
1208 1268 


fetal muscle 


Invitrogen 


hMS002 


537 


fetal skin 


Invitrogen 


T7C XT' c\ rv 1 

FSK001 


13-26 32 41 51 89 107 1 1 1 147 151 
225 264 316 405 422-429 488-494 496 
519 534-535 537 566 675 732 859 876- 
877 898 947 949-950 963 976 1001 
1062 1076 1083 1117 1144 1165 1268 
1281 


fetal skin 


Invitrogen 


FSK002 


537 812 


fetal spleen 


BioChain 


FSP001 


87 549 


umbilical cord 


BioChain 


FUC001 


27-33 41 49 151 215 238 248-249 301 
316 446 495-503 519 521 534-535 537 
582 634 691 877 883 927 944-950 963 
976 1001 1075 1142-1143 1171 1218 
1243 1308 


fetal brain 


GIBCO 


HFB001 


41 49 57 79 87 103 1 1 1 120 132-135 
138 145 151 188 197 207 215 238 264 
271 294 316 367 434 440 446 466 504 
513-514 535 542-543 550 564 571 596 
635 648-654 675 71 1-715 722-723 798 
832 872 876 883 927 976 1095 1 144 
1168 1171 1178 1211 1335 


macrophage 


Invitrogen 


HMP001 


238 


infant brain 


Columbia 
University 


IB2002 


49-50 77 81 89 105 111 136-138 140 
151 161 175-179 185 216-217 264 295 
299 308-310 373-373 462 476 504 51 1- 
513 533 537 564 566 571 655-657 662 
683 716-720 723 752 790-803 829 832 
858-859 876 898 909 949 976 1045- 
1047 1076-1087 1090 1093 1116 1122 
1144 1209-1213 1225 1233 1256 1319 
1341 


. : 

infant brain 


Columbia 
University 


lt>2uu3 


41 50 77 104 132 215 238 508 512-513 
519 566 655 714 794 918 943 976 1067 
1092-1093 1233 


infant brain 


Columbia 
University 


IBM002 


311 472-473 753 1214 


infant brain 


Columbia 
University 


IBS001 


51 111 376 474 790 876 949 1144 1204 
1221 


lung , fibroblast 


Strategene 


LFB001 


151 316 462 514 534 5S2 675 939 1131 


lung tumor 


Invitrogen 


LGT002 

1 


1-7 41 74 79 94 115 120 138-139 156 
215 217 269 2S0 296 337 374-375 384 
404 446 454 475-480 498 5 1 4 5 1 8-5 3 9 
522 537 545 564 577 597 653 658 705 
721-724 754-756 779 859 868 872-874 
876-877 919 927 949 951-952 959 976 
. 1002 1042 1048-1053 1076 1083 1088- 
1089 1131 1144-1147 1216-1218 1229 
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Tissue Origin 


RNA Source 


Hyscq Library Name 


SEQ ID NOS: 








1293 1311 


lymphocytes 


ATCC 


LPC001 


41 74 111 132 151 253 316 446 550 
634 844 927 976 1085 1268 


leukocyte 


GIBCO 


LUC001 


S 11 41 74 S6 91-98 101 109 111 120 
147 151 212 215 218 238 252 288 312- 
314 316 338 359 408 427 443-447 505 
510 512 514 518 534 545 549-550 561 
564 566 571 577 580 582 587-609 615 
632-638 658-659 698 714 725-728 832 
836 841 859 866 873-874 882-883 918- 
919 927 943 952 963 976 1042 1076 
1083 1090 1148 1152 lloo 119!) 1219- 
1220 1224 


leukocyte 


Clontech 


LUC003 


74 100 215 232 238 339-341 446 545 
657 660 729 873 883 927 952 963 1008 
1042 1116 1120 1149-1150 1215 1222 


Melanoma from cell 
line ATCC #CRL 
1424 


Clontech ! 


MEL004 


210 215 238 342 534 545 592 722 873 
919 929 939 952 976 1071 1118 1218 
1235 1245 


mammary gland 


Invitrogen 


MMG001 


8-10 40-41 49 73 80 114 138-140 147 
217 250-256 264 297-299 305 377-378 
398 446 481-486 505 512 537 545 549 
571 592 725 730-733 816 829 836 844 
86S 873 876-877 898 926 943 951-960 
963 976 995 1034 1042 1048 1054- 

i r\ c e i r\i£. 1 AO 1 A0 1 1 AAO 1 1 1 /C 111'? 

105^ 1076 1083 1091 1U93 1110-111/ 
1 124 1 152 13U2 


induced neuron cells 


Strategene 


NTD001 


39 101 111 138238 361 1225 1251 
1319 


retinoid acid induced 
neuronal cells 


Strategene 


NTR001 


74 225 976 


neuronal cells 


Strategene 


NTU001 


129 225 238 304 313 361 657 976 


pituitary gland 


Clontech 


PIT004 


976 


placenta 


Clontech 


PLA003 


38 976 1 


prostate 


Clontech 


PRT001 


1 1 1 188 238 257-258 564 724 961-966 
1067 1095 


rectum 


Invitrogen 


REC001 


238 430-431 841 859 868 963 1001 
1116 


salivary gland 


Clontech 


SAL001 


8 151 402 432-433 446 496 868 952 
976 1083 1120 1151 1184 


small intestine 


Clontech 


SIN001 


8 101 147 215 259-266 446 462 505 
545 592 660 789 836 866 873 927 952 
963 967-978 1042 1120 11j2 12z3- 
1224 


skeletal muscle 


Clontech 


SKM001 


238 302 927 943 992 1U31 


spinal cord 


Clontech 


SPC001 


74 111 132 151215-216 238 264 267- 
270 343-344 353 o /9 j Id jo / joo /4U 
828 927 976 979-994 1092 1153-1159 
1225 1250 


adult spleen 


Clontech 


SPLcOl 


698 859 1042 


stomach 


Clontech 


STO001 


210 238 271-272 537 580 705 918 952 
995 1171 


thalamus 


Clontech 


THA002 


61 219-220 273-276 312 315 330 596 
qaq QQ6-1007 1059 1093 1160-1162 


thymus 


Clonetech 


THM001 


8 120 151 208 221 316-317 353 639 
750 867 874 878-881 927 963 1023 
1083 1094-1096 1124 


thymus 


Clontech 


THMc02 


8 61 114 129 132 210 225 231 306 
317-319 336 340 359 380 398 446448- 
463 512 519 545 554 587 598 698 724- 
725 789 812 836 868 873 927 947 952 
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Tissue Origin 


RNA Source 


Hyseq Library Name 


SEQ IDNOS: 








976 1007 1042 1083 10S5 1097-1116 
1122 1147 1177 1226-1229 1234 1311 
1313 


thyroid gland 


Clontech 


THR001 


14 41 49 76 94 111 144 151 183 188 
2 1 0 2 1 7 222 253 264 27 1 277-286 294 
320-326 345-352 361 3S1-3S2 446 467 
483 514 534 549-550 564 578 602 649 
844 882-883 927 950 956 976 1008- 
1028 1076 1083 1 1 17-1 120 1 142 1163- 
1175 1230-1238 1308 


trachea 


Clontech 


TRC001 


223-225 238 287 353-354 514 
545 592 61 1 873 883-884 927 
952 1029-1031 1042 1151-1152 
1170 1176-1177 1239 


uterus 


Clontech 


UTR001 


151 226 288-290 355 537 877 
885-886 976 1001 1032-1033 
1232 



TABLE 2 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
W ate i man 
Score 


% 

Identity 


1 


B02829 


Homo sapiens 


Human G protein coupled receptor hRUP5 
protein SEQ ID NO: 10. 


460 


100 


2 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7645. 


1 1 1 


51 


3 


R26173 


Homo sapiens 


Part of Maior Yo naran^onla^tif antitren 
(CDR62) encoded by clone pY2. 


293 


76 


4 


L29536 


Homo sapiens 


calcium channel L-type alpha 2 subunit 


191 


65 


5 


Y94943 


Homo sapiens 


Human secreted protein clone yt!4J protein 
sequence SEQ ID NO:92. 


251 


50 


6 


Ml 1507 


Homo sapiens 


transferrin receptor 


120 


95 


7 


AF099100 


Homo sapiens 


WD-repeat protein 6 


1941 


93 


8 


V92338 


Homo sapiens 


Human cancer associated antigen precursor from 
clone NY-REN-45. 


245 


82 


9 


GO 1343 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5424. 


226 


91 


10 


AJ133798 


Homo sapiens 


copine VII protein 


1127 


68 


11 


G02449 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6530. 


584 


99 


12 


X98330 


Homo sapiens 


ryanodine receptor 2 


282 


78 


13 


AL024498 


Homo sapiens 


d/417M14.2 (novel serine/threonine-protein 
kinase (ortholog of mouse and rat MAK (male 
germ cell-associated kinase)) 


293 


100 


14 


AF045577 


Pan 

troglodytes 


olfactory receptor OR93Ch 


191 


36 


15 


G03131 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7212. 


93 


39 


16 


U26595 


Rattus 
norvegicus 


prostaglandin F2a receptor regulatory protein 
precursor 


569 


89 


17 


B08918 


Homo sapiens 


Human secreted protein sequence encoded bv 
gene28SEQIDNO:75. 


99 


44 


18 


Y36203 


Homo sapiens 


Human secreted protein #75. 


165 


75 


19 


U15647 


Mus 

musculus 


reverse transcriptase 


106 


40 


20 


G02701 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6782. 


544 


100 


21 


Y35923 


Homo sapiens 


Extended human secreted protein sequence, SEQ 
ID NO. 172. 


1691 


100 


22 


G04030 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8111. 


380 


96 


23 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6536. 


123 


50 


24 


AF036329 


Homo sapiens 


gonadotropin-releasing hormone precursor, 
second form 


284 


90 


25 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8148. 


96 


32 


26 


S80119 


Rattus sp. 


reverse transcriptase homolog 


100 


34 


27 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


101 


35 


28 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7348. 


135 


45 
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SEQ 
ID 


Accession 

JNO. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


29 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8148. 


83 


42 


30 


i L 


tlOmO SupiCIli> 


Human secreted nrotein SEO ID NO: 6953. 


116 


72 


31 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7452. 


96 i 


67 


32 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7305. 


58 


32 


33 


Y 66688 


Homo sapiens 




2457 


98 


34 


Y87071 


Homo sapiens 


Human secreted protein sequence SEQ ID 
xirvi in 

INU, 1 lu. 


348 


95 


35 


U15131 


Homo sapiens 


pl26 


182 


48 


36 


Y73464 


Homo sapiens 


Human secretea protein cione yi*+^_i pruLcui 
sequence ocjv^ iu inu.uw. 


982 


90 


37 


AL133215 


. 

Homo sapiens 


ka 1 OCT *7 ft ( <zt*m'*r\\\i\T\n AC\ f «;ema domain 

immiinnolnhnliri rlomain fTp^l transmembrane 
llllillUIlUJdlUUUllll UV/HlfiiiJ imuoiiiviiiwiuiiw 

Hnmain fTI^/fl and <?hort cvtonlasmic domain)) 


687 


99 


38 


AC067969 


amino acids 


Homo sapiens ryanodine receptor 1 (skeletal) 


386 


66 


39 


AL031588 


Homo sapiens 


dJl 163J1.1 (mostly supported by GENSCAN, 

FnF"WFS and flFNFWTSE^ 


493 


76 


40 


GUiozo 


— — — 

Homo sapiens 


Hnman secreted nrotein SEO ID NO' 7709. 


110 


51 


41 


AJr 


Homo sapiens 




228 


68 


42 


Y36268 


Homo sapiens 


Hums*!-! cprrptpH nrntein encoded bv CT ene 45. 


220 


88 


43 


X61048 


Hydra sp. 


mini-collagen 


105 


35 


44 


M76546 


He)ianthus 
annuus 


nyaroxypr t/iiiic-r icii piuLciu 


110 


31 


45 


US2288 


Caenorhabditi 
s eiegans 


Rac-like GTPase 


139 


70 


46 


G03477 


Homo sapiens 


Wnmon c<>rreteH nrntein SFO ID NO' 7558 


118 


58 


47 


AF09U942 


Homo sapiens 




113 


63 


48 


G03564 


Homo sapiens 


u lini! , n cerret^ri nrntein SFO ID NO' 7645 

fj. Li III till oCi/rdCU JJI ULt-ll 1, OLy *w-r„'. 


90 


59 


49 


AJ005560 


Mus 

musculus 


SPR2B protein 


72 


56 


50 


G02450 


Homo sapiens 


Human cprretpH nrntein SEO ID NO' 6531 


385 


98 


51 


Y9164V 


Homo sapiens 


T-lum-ar* c^rrft^ri nrntein ^ennence encoded bv 

gene 60 SEQ IDNO:322. 


973 


94 


52 


U93563 


Homo sapiens 


puLdllVC pi JU 


105 


38 


53 


Y55927 


Homo sapiens 


Human STLK2 protein. 


699 


85 


54 


G02607 


Homo sapiens 


T4iitt»c»m c^prr*t^d nrntein SFO ID NO' 6688 


145 


56 


55 


AB00S175 


Mus 

musculus 


nepatic nucjedr lacior i-oclci aiiuiL lunu 


356 


74 


56 


M68941 


Homo sapiens 


proiem-Lj'robinc piiupn<ii.ttoc 


165 


41 


57 


AL0316UU 


Homo sapiens 


c^yuxio.i ^ciiioriuc uiicuiiici / j 


338 


76 


58 


AF0U417 


Mus 

musculus 


pULallVe pilCI UI11U11C ICL-CpiUl 


143 


55 


59 


AJF 167320 


Mus 

musculus 


zinc linger protein ^rr iu 


558 


68 


60 


U73036 


Homo sapiens 


interferon regultory factor 7 


263 


"96 


61 


X07984 


Mus 

musL-u iub 


proiein-iyruoinc Kinase 


297 


69 


62 


Y29861 


Homo sapiens 


Human secreted protein clone cb98_4. 


791 


98 


63 


U35376 


Homo sapiens 


repressor irdiibL/i lpnuiiai wviui 


485 


65 


64 


AF265:o5 


Homo sapiens 


UDlUUIllll WU11J U^d.1111^ i_> 1 IX til 1 1 Ldliijrill^ 

A POT T ON 


785 


74 


65 


GU3ob3 


- — ~-r : 

Homo sapiens 


T-Tnman secreted nrotein SEO ID NO: 7964. 


88 


95 


66 


AF 177390 


Manduca 
sexta 


antennal specific membrane protein AMP 


274 


54 


67 


AB040800 


Homo sapiens 


SREB2 


614 


100 


68 


AF030027 


Equine 
herpesvirus 4 


7A 


213 


26 


69 


G02965 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7046. 


261 


95 


70 


W75770 


Homo sapiens 


Human oxidoreductase YTF03. 


1144 


98 


71 


AB011135 


Homo sapiens 


KIAA0563 protein 


239 


76 


72 


AB014885 


Halocynthia 
roretzi 


HrPOPK-1 


813 


78 


73 


AE045454 


Cavia 
porcellus 


phospholipase B 


955 


73 


74 


J02870 


Mus 


laminin receptor 


308 


61 



105 



^SDOCID: <WO 0157188A2_I_> 



WO 01/57188 




CT/USO 1/03800 



SEQ 

ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 






musculus 








75 


Y00S26 


Rattus 
norvegicus 


gp210(AA 1-1886) 


413 


84 


76 


AF 117754 


Homo sapiens 


thyroid hormone receptor-associated protein 
complex component TRAP240 


35 1 


54 


77 


Y38422 


Homo sapiens 


Human secreted protein. 


46S 


76 


78 


Y14596 


Homo sapiens 


Human T-type voltage-gated Ca channel alpha- 
l -1 (hCavT3^ 


1357 


99 


79 


VI 459 1 


Human 
papillomaviru 
s type 68 


APM-1 protein 


767 


100 


80 


AL 137802 


Homo sapiens 


dJ798A10.2 (K1AA0445 protein) 


71 


34 


81 


AP000383 


Arabidopsis 
thaliana 


protein arginine N-methyltransferase-like protein 


359 


65 


82 


L46S15 


Mus 

musculus 


DNA binding protein Rc 


895 


75 


83 


GO 1600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5681. 


315 


96 


84 


Y53SS6 


Homo sapiens 


A sunoressor of cvtokine signal line protein 
designated HS COP-6. 


538 


71 


S5 


AB029002 


Homo sapiens 


K1AA1079 protein 


134 


42 


86 


Y28678 


Homo sapiens 


Human cw272 7 secreted protein. 


325 


62 


87 


Y99368 


Homo sapiens 


Human PRO 1326 (UNQ686) amino acid 
sequence SEQ ID NO: 100. 


156 


48 


88 


AJ225124 


Mus 

musculus 


hvperpolarizati on-activated cation channel, 
HAC3 


487 


95 


89 


AF 177203 


Homo sapiens 


cerebral cell adhesion molecule 


290 


56 


90 


Y2S280 


Homo sapiens 


Human G-protein coupled receptor GRJR-2. 


326 


79 


91 


L39891 


Homo sapiens 


polycystic kidney disease- associated protein 


1751 


95 


92 


AF064876 


Homo sapiens 


ion channel BCNG-1 


953 


99 


93 


AF 170723 


Homo sapiens 


protein kinase STK10 


401 


53 


94 


X13292 


Trypanosoma 
brucei 


GPl-DhosDholiDase C (AA 1 - 358^ 


151 


37 


95 


Y34127 


Homo sapiens 


Human potassium channel K+Hnovll. 


661 


99 


96 


X03638 


Rattus 
norvegicus 


sodium channel nrotein 1 (aa 1-2009^ 


1775 


92 


97 


AF134213 


Homo sapiens 


ubiquitin-speciflc protease 


1995 


99 


9S 


GO0838 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4919. 


213 


38 


99 


AF021935 


Rattus 
norvegicus 


u i_y luii \iy on \j \ji ly iviiia.oc-iciu.LCLi VjUL/t*. uiiivllilj^ 

kinase 


\j i j 


4R 


100 


AF279265 


Homo sapiens 


putative anion transporter 1 


867 


98 


101 


AC007878 


Homn ^ani^n*; 


march to nuclpar nrAfrfin TsTP'?9ft' n rvtp ■ Q^mifnr'p 
difference at residue S8 


160 


ou 


102 


U22829 


Mus 

musculus 


P2Y purinoceptor 


264 


42 


103 


Y45023 


Homo sapiens 


Human sensorv transduction G-protein coupled 
receptor-B3. 


516 


99 


104 


Y94990 


Homo sapiens 


Human secreted protein vb21__l, SEQ ID NO:20. 


787 


98 


105 


Y87342 


Homo sapiens 


Human signal peptide containing protein HSPP- 
119SEQlDNO:119. 


343 


57 


106 


AF169312 


Homo sapiens 


hepatic angiopoietin-related protein 


212 


67 


107 


AF1 16657 


Homo sapiens 


PRO 13 10 


74 


52 


108 


AE000401 


Escherichia 
coli 


sialic acid transporter 


587 


96 


109 


Y38395 


Homo sapiens 


Human secreted protein encoded by gene No. 10. 


693 


100 


110 


Y78801 


Homo sapiens 


Hydrophobic domain containing protein clone 
HP00631 amino acid sequence. 


182 


94 


111 


Z25535 


Homo sapiens 


nuclear pore complex protein hnupl53 


464 


85 


112 


Y94939 


Homo sapiens 


Human secreted protein clone ye90_l protein 
sequence SEQ ID NO:84. 


274 


51 


113 


AFO 16365 


Homo sapiens 


hexokinase 1 isoform td 


301 


71 


114 


AC007956 


Homo sapiens 


unknown 


520 


75 


115 


M83738 


Homo sapiens 


protein-tyrosine phosphatase 


251 


92 


116 


AL 157952 


Homo sapiens 


dJ875K15.1.1 (ets homologous factor (ets- 
domain transcription factor ESE-3A, isofi rm 1)) 


484 


91 


117 


W 18084 


Homo sapiens 


Human Aurora-2. 


546 


87 



106 



WO01/57188 «-« PCT/US01/U3800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 
cam kinase 1 


Smith- 

VN alCs.LiiJ.aii 

Score 
407 


% 

Tflpntitv 
62 


118 
119 


L41816 
AJ006710 


Homo sapiens 

Rattus 

norvegicus 


phosphatidylinositol 3 -kinase 


627 


93 


120 


AF026954 


Bos taurus 


pyruvate dehydrogenase phosphatase regulatory' 
subunit precursor, PDPr 


1646 


94 


121 


S3 93 92 


Homo sapiens 


protein tyrosine phosphatase, PTPase {EC 
3.1.3.48} 


373 


68 


122 


U60805 


Homo sapiens 


oncostatin-M specific receptor beta subunit 
Human truncated tankyrase-1. 


262 
111 


88 
35 


123 
124 


Y44403 
U8S167 


Homo sapiens 
Caenorhabditi 
s elegans 


contains similarity to C2 domains 


219 


29 


125 


AF300648 


Homo sapiens 


guanine nucleotide binding protein beta subunit 
4 


693 


90 


126 


AB021861 


Mus 

musculus 


apoptosis signal-regulating kinase 2 


153 


65 


127 
128 


AF305210 
M90360 


Homo sapiens 
Homo sapiens 


concentrative JNa+- nucieosiue coiranspoiici 

hCNT3 

protein kinase 

alpha 1C adrenergic receptor isoform 2 


807 

220 
574 


97 

73 
86 


129 
130 
131 


D32202 

AF208043 

AF201734 


Homo sapiens 
Homo sapiens 
Mus 

musculus 


IFI16b 

testis specific serine kinase-3 

differentiation enhancing factor 1 j_ 


496 
800 

159 


67 
87 

74 


132 
133 


AF 112886 
AJ278314 


Bos taurus 
Homo sapiens 


phospholipase C-beta-lb 


554 


85 


134 
135 


W74802 
AB020335 


Homo sapiens 
Homo sapiens 


Human secreted protein encoded by gene 73 
clone HSQEL25. J 
Pancreas-specific gene 


1 157 
668 


87 
96 


136 | 
137 


W80408 
AC002563 


Homo sapiens 
Homo sapiens 


A secreted protein encoded by clone dt674_2. 
putative RHO/RAC effector protein; 95% 
similarity to r4y2U3 {riu.gli^Doov) 
PR03434, a novel secreted protein. 


866 

Jut 1 

891 


98 
99 

100 


138 
139 


Y96736 
AB024034 


Homo sapiens 

Arabidopsis 

thaliana 


DNA-damage inducible protein DDI 1 -like 


147 

It/ 


55 


140 


W97809 


Homo sapiens 


Human GTPase regulator GRAF. 


248 
125 


56 
46 


141 
142 


Y51557 
AF090113 


Homo sapiens 

Rattus 

norvegicus 


Human PLA2 protein. 

AMPA receptor binding protein 


623 


93 


143 


W26642 


Homo sapiens 


Human RECK cancer-inhibiting protein. 


641 


82 


144 


U87306 


Rattus 
norvegicus 


transmembrane receptor UNC5H2 


578 


84 


145 


AF264014 


Homo sapiens 


scavenger receptor cysteine-rich type 1 protein 
Ml 60 precursor 


727 


92 


146 


W63683 


Homo sapiens 


Human secreted protein 3. 

galactose- 1 -phosphate uridyl transferase 


140 
513 


40 
81 


147 
148 


M96264 
D64014 


Homo sapiens 

Escherichia 

coli 


I*rsA 


818 


90 


149 


M83316 


Escherichia 
coli 


pppGpp phosphohydrolase 


915 


95 


150 


AL 163279 


Homo sapiens 


nomolog to cajvlt response eiemcnL uuiuui-, <mu 
beta transducin familv proteins 


1261 


99 


151 


AF 179867 


Homo sapiens 


STE20-like kinase 


940 


99 


152 


R95332 


Homo sapiens 


Tumor necrosis tactor receptor 1 death domain 
ligand (clone 3TW). 


392 
370 


61 

92 


153 
154 


AF151859 
X66957 


Homo sapiens 
Homo sapiens 


CG1-101 protein 
hexokinase type 1 


489 


81 


155 


Y16355 


Homo sapiens 


alternatively spliced form 


432 


92 


l:>o 


Uuv/oj / 


Wfkmn <?finiens 


Human secreted protein, SEQ ID NO: 4938. 


349 


78 


157 


AF 159455 


Mus 

musculus 


zinc finger protein 

interleukin-1 receptor-associated kinase 


352 
537 


74 
76 


158 
159 


L76191 
AP001743 


Homo sapiens 
Homo sapiens 


putative gene, ankirin like, possible dual 
specifity'Ser/Thr/Tyr kinase domain 


670 


98 


160 
161 


AJ250425 
G02885 


Rattus 
non 2gicus 
Homo sapiens 


Collybistin I 
" Human secreted protein, SEQ ID NO: 6966. 


556 
370 


74 
100 



107 



MSDOCID- <WO 0157188A2_I_> 




T/US0 1/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


162 


Z2296S 


Homo sapiens 


Ml 30 antigen 


610 


100 


163 


AF181121 


Homo sapiens 


ATP-dependent Ca2+ pump PMR1 


336 


92 


164 


AF055636 


Homo sapiens 


leucine-rich glioma-inactivated protein precursor 


455 


94 


165 


AF 160798 


Rattus 
norvegicus 


calcium transporter CaTl 


700 


96 


166 


Y76332 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 38. 


327 


45 


167 


Y48607 


Homo sapiens 


Human breast tumour-associated protein 68. 


1072 


99 


168 


AB020741 


Mus 

musculus 


NIK-related kinase 


197 


43 


169 


AF252293 


Homo sapiens 


PAR3 


596 


44 


170 


U59429 


Cricetinac 
gen. sp. 


diacylglycerol kinase eta 


481 


82 


171 


AF035268 


Homo sapiens 


phosphatidylserine-specific phospholipase Al 


386 


42 


172 


AF 12708 5 


Mus 

musculus 


semaphorin cytoplasmic domain-associated 
protein 3B 


507 


82 


173 


Y27918 


Homo sapiens 


Human secreted protein encoded by gene No. 
123. 


653 


99 


174 


G02979 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7060. 


538 


97 


175 


U36488 


Mus 

musculus 


emhrvonic stem cell nhosohatase 


168 


55 


176 


W95629 


Homo saDiens 


Homo sapiens secreted protein gene clone 
gml96 4. 


1022 


100 


177 


AF2S9023 


Homo sapiens 


formiminotransferase cyclodeaminase form D 


255 


93 


178 


X04936 


Homo sapiens 


T-cell receptor alpha-chain (413 is 2nd base in 
codon) 


710 


99 


179 


AF 127481 


Homo sapiens 


non-ocogenic Rho GTPase-specific GTP 
exchange factor 


175 


80 


180 


G00978 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5059. 


517 


94 


181 


Y66645 


Homo sapiens . 


Membrane-bound protein PRO 13 10. 


671 


96 


182 


AF1 10640 


Homo sapiens 


orphan seven- transmembrane receptor 


862 


100 


183 


AB020854 


Bos taurus 


orphan transporter short splicing variant 


766 


84 


184 


AF 169691 


Homo sapiens 


cadherin-like protein VRS 


375 


38 


185 


AF126372 


Homo sapiens 


thyrotropin-releasing hormone degrading 
ectoenzyme 


985 


99 


186 


L20966 


Homo sapiens 


phosphod i esterase 


541 


76 


187 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7001. 


254 


93 


188 


Y94918 


Homo sapiens 


Human secreted protein clone dd504_18 protein 
sequence SEQ ID NO: 42. 


301 


98 


189 


Y66713 | 


Homo sapiens 


Membrane-bound protein PRO 1309. 


694 


100 


190 


G03244 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7325. 


331 


73 


191 


U36771 


Rattus 
norvegicus 


^n-olvcerol ^-nhosnhatf* acvltran^tVra^f* 


707 


92 


192 


R05935 


Homo sapiens 


Secreted GPIIb subunit of multiple subunit 
polypeptide (MSP)GPIIb-IIIa. 


157 


72 


193 


M92084 i 


Theilcria 
parva 


casein kinase II alpha subunit 


364 


50 


194 


Y66645 


Homo sapiens 


Membrane-bound protein PRO1310. 


448 


90 


195 


W95631 


Homo <;anien^ 


Homo sanien<; <;ecr£*ted nrotein penp rlonp 

hj968 2. 


382 


49 


196 


AF255614 


Rattus 
norvegicus 


scaffolding Drotein SLIPR 


680 


99 


197 


AC021640 


Arabidopsis 
thai i ana 


putative phosphatidate phosphohydrolase 


300 


41 


198 


AF073967 


Mus 

musculus 
domesticus 


olfactory receptor 


316 


43 


199 


WO 1730 


Homo sapiens 


Human G-protein receptor HPRAJ70. 


617 


98 


200 


AF 117948 


Homo sapiens 


pancreas-enriched phospholipase C 


625 


89 


201 


AF 128625 


Homo sapiens 


CDC42-binding protein kinase beta 


636 


94 


202 


AF 11 7946 


Homo sapiens 


Link guanine nucleotide exchange factor II 


1303 


100 


203 


Y53021 


Homo sapiens 


Human secreted protein clone qc646_l protein 
sequence SEQ ID NO:48. 


701 


99 


204 


AF22796S 


Homo sapiens 


SH2-B beta signaling protein 


182 


79 


205 


S81752 


Homo sapiens 


DPH2L=candidate tumor suppressor gene 


375 


100 



108 



3DOCID: <WO 0157188A2_I_> 



WO01/571KS 



PCT/US01/03800 



SEQ 
ID 

NO: 

206 


Accession 
No. 

U18315 


Species 1 
Sus scrofa 


Description 

{ ovarian cancer critical region of deletion} 
parathvroid receptor 


Smith- 
Waterman 
i>corc 

122 

1 fyf 


% 

Identity 

60 
96 


207 
208 
209 
210 


AF255342 
S52051 
W63683 
D79992 


Homo sapiens 
Rattus sp. 
Homo sapiens 
Homo sapiens 


putative pheromone receptor v 1KL1 long torm 

neurotransmitter transporter 

Human secreted protein 3. 

similar to Drosophila photoreceptor cell-specific 

protein, calphotin. 


715 
840 
541 

1348 


94 
99 

o— 

99 


211 

212 


AF 117948 
U8I035 


Homo sapiens 

Rattus 

norvegicus 


pancreas-enriched phospholipase C 
ankyrin hinding ceil adhesion molecule 
neurofascin 


471 
798 


69 
56 


213 
214 


AF 154846 
AF 102777 


Homo sapiens 
Mus 

musculus 1 


zinc finger protein 

FYVE finger-containing phosphoinositide kinase 
putative gene containing transmembrane domain 


933 


93 
RQ 


215 
216 

217 


AL 163303 
U26595 

G04095 


Homo sapiens 
Rattus 
norvoji,n-ub 
Homo sapiens 


prostaglandin F2a receptor regulatory protein 
precursor 

Human secreted protein, SEQ ID NO: 8176. 


563 

644 
314 


78 

98 
81 


218 
219 
220 


X75756 
Y66723 
D88577 


Homo sapiens 
Homo sapiens 
Mus 

musculus 


protein kinase C mu 
Membrane-bound protein PRO 1 100. 
Kupffer cell receptor 


770 
567 

853 


70 

40 
100 


221 

222 


AF258465 
AF021935 


Homo sapiens 

Rattus 

norvegicus 


OTRPC4 

mytonic dystrophy kinase-related Cdc42-binding 
kinase 


636 


96 


223 


AL136527 


Homo sapiens 


DA215B13.1 (A kinase (PRKA) anchor protein 
1 1 \ 

1 1 ) 

WNT receptor Fnzzled-4 


693 
690 


100 
99 


224 
225 


AB032417 
AF030430 


Homo sapiens 
Mus 

musculus 


semaphorin Via 


703 


68 

ICk 


226 


AE000218 


Escherichia 
coli 


putative dihydroxyacetone kinase (EC 2.7.1.2) 
phosphoinositol 3 -phosphate- binding protein-2 


297 
2080 


5y 
100 


227 
228 


AF302150 
AB024573 


Homo sapiens 
Mus 

musculus 


G TP-binding like protein 2 


265 


88 


229 
230 


AF 122924 
G03205 


Xenopus 
laevis 

Homo sapiens 


Wnt inhibitory factor- 1 

Human secreted protein, SEQ ID NO: 7286. 
M-phase phosphoprotein 1 1 


316 

229 
265 


100 
92 


231 

232 
233 

234 


X98260 
R92754 
R75111 

W69431 


Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 


Human growth differentiation factor- 12. 
Glycosyl-phosphatidylinositol-specific 

n h n cr» h o 1 i nn ^e-D 

Human secreted protein cw 1233__3. 
cprinp nalmitnvltransferase subunit II 


682 
290 

235 
859 


95 
100 

97 
81 


235 
236 
237 


Y0S686 
AF 118275 
X81466 


Homo sapiens 
Homo sapiens 
Mus 

musculus 


atrophin-related protein ARP 


117 
460 


37 
62 


238 


U64857 


Caenorhabditi 
s elegans 


similar to the BPTI/Kunitz family of inhibitors; 
most similar to tissue factor pathway inhibitor 
precursor (TFPI) 


284 


33 


239 


AJ250840 


Mus 

musculus 


o/»rirn=*7thrf»r»ninp nrotein kinase 


739 


63 


240 


AJ223472 


Mus 

musculus 


transcription elongation factor TFlIS.h 


222 


38 


241 


Y94906 


Homo sapiens 


Human secreted protein clone rb649_3 protein 
sequence SEQ DO NO: 18. 
Na+/su1fate cotransporter SUT-1 


353 
591 


52 
"~99 


242 
243 


AF169301 
L22022 


Homo sapiens 

Rattus 

norvegicus 


orphan transporter v7-3 


667 


93 


244 

245 
246 
247 
248 
249 


AF016191 

AF097366 
Y29868 
AF 180475 
Y 17227 
AF250910 


Rattus 
norvegicus 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sap'^ns 
Manduca 


potassium channel 

cone sodium-calcium potassium exchanger 
Human secreted protein clone pp325 9. 
Not4-Np 

Human secreted protein (clone yal-1). 
death-associated small cytoplasmic leucine-rich 


1043 

645 
497 
188 
690 
182 


98 

98 
98 
83 
99 
31 



109 
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omun- 


/o 


ID 


No. 






W otp rm ill 


lucnu ly 


NO: 








Score 








sexta 


protein SCLP 






250 


AF 192756 


Kaposi's 


Orf73 


134 


34 






Jul vVIUCl 












associated 












herpesvirus 








251 


AB022694 


Homo sapiens 


MOK protein kinase 


209 


S3 


252 


W55045 


Homo sapiens 


Neural adhesion molecule (cthb0018f2 product). 


469 


100 


253 


L46815 


Mus 


DNA binding protein Rc 


251 


67 






musculus 








254 


W68505 


Homo sapiens 


Human acid sensing ionic channel. 


173 


82 


255 


AF070066 


Mus 


Citron-K kinase 


1201 


98 






musculus 








256 


G02491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6572. 


460 


100 


257 


Z12841 


Orvctolagus 


Phospholipase 


368 


80 






cuniculus 








258 


Y95436 


Homo sapiens 


Human calcium channel SOC-3/CRAC-2. 


1857 


99 


259 


AJ222968 


Mus 


T ,-periaxin 




72 






musculus 








260 


AJ250839 


Homo sapiens 


•seri nf /threoninf* nrnt<»in kin?*^ 

■Jv 1 ' • it**/ 11 11 uuj 111 1*^ ynji\sin i\. iiiajt- 


861 


100 


261 


AJ249977 


Homo sapiens 


AlVTP-acti vat^H nrfttcin kinfi^^ Mmma ^ cnKiinit 
ivi'ii aviii vaiv^u jjiull<iii \iikuv/ g,tuiiinci J OUULlj 11 1 


7SR 

/JO 


9ft 

70 


262 


AF141386 


Rattus 


SLIT-2 


198 


40 






norvegicus 








263 


AF022S59 


Homo sapiens 


J 1>^U| \JyJl 1 111 i-1 uvj 


j j j 


Da 


264 


API 60477 


Homo «\?inif*n<; 


7 p- ctm^rfami l\/ r<" p ^ n tn r T 7\IT1? nrprnrcnr 

Juj .» V^l i U-1 1 1 1 1 y 1 \j C JJ IC i l_rlNJ.XV [Ji t^UI oUI 


U?7 


91 


265 


Y44662 


Homo sapiens 


Human 1497S O-nrotein ronnleri «*rf»nfnr 


636 


99 








(GPCR). 






966 
add 


T 797969 


Mus 


sodium glucose cotransporter 


— — 

204 


56 






IUUOvUIUj 








967 




numu bapiciib 


/■vjtxr u j r dse-dCLi vdting protein 01 1 z 


1 <o 

1 




268 


AF 127389 




pUldllVC IdMC lC-LCpiUJ I IV J 




jy 






norvegicus 








269 


X98296 


X \\J \l\\J jQUIvllo 


1 1 j 1 1 1 1 n hvHrn loco 

uuicjuiiiii 1 1 y ui u itibc 


9 1 ^ 
L 1 D 


7J 


270 






r t.-gaiTiilia icccpior 




0< 

-O 






pyogenes 








97 I 




J vlUUlIidJ Id 




j oy 


ZD 














979 




Mus 


\ / OQ 1 f\ ^nmoin rArpnfrkr nr/\f«in CADPC 

VI oiu aomain receptor protein oUKLo 


con 


y / 






111 lioCUJ US 








273 


L34938 


Rattus 


ii ji iuu ijpiu giuiciiiidic icccpior 




OO 






norvegicus 








274 


AL099724 


Hnmn ^p>nit»n^ 

J1UU1U DajJl til 0 


ujtijno.i.i ^iituiiaicr /^lurtjgcii-ucpcnocni 


i OO 










Exnressed Protein 1 TKF PI IT ATI VF nmfrein^ 












(isoform 1) 






275 


AF265555 


Homo sapiens 


ubiquitin-conjugating BIR-domain enzyme 


173 


94 








APOLLON 






276 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


148 


56 


277 


L403S0 


Homo sapiens 


thyroid receptor interactor 


430 


61 


278 ! 


AB046851 


Homo sapiens 


KJAA 1 63 1 protein 


283 


9fi 


279 


AC008075 


Arabidopsis 


Contains PFI00069 Eukarvotic orntein kinase 


1 57 


43 






thaliana 


domain. 






280 


M83738 


Homo sapiens 


|J1 VJLvlll Ijl vJlllw piiUjjjnulUOv 


J O i 




281 


AK024397 


Homo sapiens 


unnamed protein product 


439 


91 


282 


AF141326 


Homo sapiens 


RNA helicase HDB/D1CE1 


497 


84 


283 


AF156530 


Mus 


FTR-riomain tran^rrintional r^nrpcenr PF1 

J-» •*. u viv/iiicim Licui^viipuuilul IvplCdSUl Jt LI 


60S 

OUJ 


ICs 

i \j 






musculus 








284 


Y29336 


Homo sapiens 


Human secreted protein clone cs756 2 alternate 


647 


100 








reading frame protein. 






285 


Y73402 


Homo sapiens 


Human secreted protein clone yc25_l protein 


300 


90 








sequence SEQ ID NO: 26. 






286 


AF016411 


Homo sapiens 


KCNA3.1B 


137 


100 


287 


W89253 


Homo sapiens 


Human ALP. 


688 


97 


288 


AF1 12886 


Bos taurus 


differentiation enhancing factor 1 


750 


96 


289 


AF113131 


Homo sapiens 


host cell factor homolog LCP 


367 


44 


290 


U52111 


Homo sapiens 


plexin-related protein 


6< .8 


100 


291 


AF026504 


Rattus 


SPA-1 like protein p!294 


603 


89 
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SEQ 
ID 

NO: 


Accession 
No, 


Species 


Description 


Smith- 

VV aLC J lllall 

Score 


% 

THpntitv 
lut-iiu iy 






norvegicus 








292 


Ar 1 02854 


Rattus 
norvegicus 


membrane-associated guanylate kinase- 
interacting protein 2 Maguin-2 


124 


53 


293 


X992H 


Drosophila 
melanogaster 


ubiOjUitin-specific protease 


143 


38 


294 


Y 94943 


Homo sapiens 


Unman ct*r-**£*tf*r] r\twt'fMr> fifing YTt" 1 A. 1 r^r/Yfpil"> 

XiUiTian iCLretcu prt/iciii ctunc yn*+ 1 ^juji^ui 


185 


94 


295 


YV4o7W 


— : 

Homo sapiens 


Human nrntein rlnne HP02798 


108 


59 


296 


A.rUiy/o/ 


Homo sapiens 




154 


96 


297 


Y2c»3o5 


Homo sapiens 




568 


84 


298 


\JC\AC\A1 

Y 94^4 5 


Homo sapiens 


1 i U 1 1 1 cLi i b^^rcLtu pruLdii nunc* yii 1 ^ 1 (jiui^ui 

sequence SEQ ID NO:92. 


182 


97 


299 


B08906 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 10 dc^ hj i\\j*v>j. 


605 


69 


300 


R5oo9U 


— s 

Homo sapiens 


nUlliianOi CaUiiorin~rciaicu lituicuuic 


212 


97 


301 


AF022859 


Homo sapiens 


neuropilin-2(a0) 


277 


100 


302 


Y71 124 


Homo sapiens 


Human mitogenic regulator duox2. 


/ it) 


97 


303 


Y44297 


Homo sapiens 


Human receptor tyrosine kinase. 


228 


97 


304 


D32050 


Homo sapiens 


alanyl-tRNA synthetase 




R0 


305 


U43586 


Homo sapiens 


protein kinase related to Raf protein kinases; 
Method* conceptual translation supplied by 
author 




77 


306 


R:>4872 


Homo sapiens 


Human H13 viral receptor mutant 4. 


280 


95 


307 


D78572 


Mus 

musculus 


membrane glycoprotein 


199 


41 


308 


AF255614 


Rattus 
norvegicus 


scaffolding protein b.LirK 




88 


"309 


S 79463 


Mus sp. 


semaphorin homoIog=M-Sema F 


1 Oi. 


0" 


310 


AF17S941 


Homo sapiens 


ATP-binding cassette sub-family A member 2 


736 


100 


311 


U03413 


Dictyostelium 
discoideum 


calcium binding protein 


1 Jl 




312 


Y87347 


Homo sapiens 


Human signal peptide containing protein HSPP- 
124 SEQ ID NO:l24. 


744 
/ z f 4 4 




313 


Z97055 


Homo sapiens 


dJ388M5.4 (putative GS2 like protein) 


/ o / 




314 


AC004010 


Homo sapiens 


similar to Leucine-rich transmembrane proteins; 
44% similarity to U42767 (PID:gl736918) 


I Q7 


JO 


315 


AL021392 


Homo sapiens 


dJ439F8.2 (supported by (jjblNbOAIN and 


Z. /o 


JO 


316 


U70209 


Mus 

musculus 


polycystic kidney disease I protein 


I OJ 


38 


317 


AF 109643 


Rattus 
norvegicus 


coxsackie-adeno virus-receptor homo log 




38 


31S 


AF 104923 


Homo sapiens 


putative transcription factor 


138 


84 


319 


AF 100287 


Trypanosoma 
vivax 


activated protein kinase C receptor homolog 


141 


38 


320 


G0058S 


Homo sapiens 


riuman secretea protein, ocrv lis in\j. nooy. 


125 


51 


321 


Y21591 


Homo sapiens 


Human secreted protein (clone CC332-33). 


459 


97 


322 


D26070 


Homo sapiens 


numan type i mosnoi i.H,j-Lriapiiuipiia.ic 
receptor 


232 


97 


323 


Y2791b 


Homo sapiens 


Fiuman beer clcu pruicui giuajuC'U vy gciic inu. 

123. 


306 


88 


324 


ArUJU144 


Homo sapiens 


nAiifrtnoi tlir¥»a/l r\mt&tn A l"^i 7 — 7\J 

neiironcii tnrcau piuiciii /-\x>/ / c-i\ i i 


209 


70 


325 


Ml 9650 


Homo sapiens 


z ,j -cycnc-nucieouue j -pnubpnouicbLcrai>c 
3.1.4.37) 


214 


97 


326 


W80396 


Homo sapiens 


A. secretea protein encouea oy cione □po i *Q_i\j. 


1 40 


70 


327 


X75756 


Homo sapiens 


protein kinase C mu 


540 


78 


19R 
~y~o 


G02292 




Human secreted protein, SEQ ID NO: 6373. 


721 


99 


329 


AF 168990 


Homo sapiens 


putative GTP-binding protein 


877 


99 


330 


S67984 


Homo sapiens 


anti-HlV gpl20 antibody heavy chain variable 
region 


581 


80 


331 


X13916 


Homo sapiens 


LDL-receptor related precursor (AA -19 to 4525) 


2823 


98 


332 


YS7330 


Homo sapiens 


Human signal peptide containing protein HSPP- 
107 SEQ ID NO: 107. 


1127 


100 


333 


Y28503 


Homo sapiens 


.-IGFH3 Human Growth Factor Homologue 3. 


320 


9S 


334 


AC002563 


Homo sapiens 


putative RHO/RAC effector protein; 95% 


327 


93 
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SEQ 

in 

NO: 


Accession 
No. 


Species 


Description 


Smith- 

W aid mail 

Score 


% 

luciiuiy 








similarity to P49205 (PID.g 134 5860) 






335 


Y87347 


Homo sapiens 


Human signal peptide containing protein HSPP- 
124 SEQ ID NO: 124. 


1111 


67 


336 


AF006466 


Mus 

musculus 


iymphocvte specific form in related protein 


193 


75 


337 


AF265555 


Homo sapiens 


ubiouitin-coniueatine BIR-domain enzvme 

LJ* V| VA» VI 1 * VVilJUUUlillg A-JA1X VIVf liiulll wll£*Ylllw 

APOLLON 


632 


97 


338 


Y 13443 


Homo sapiens 


Amino acid sequence of hSlo3-2. 


516 


100 


339 


Y07637 


Homo sapiens 


putative GABA-gated chloride channel 


189 


100 


340 


Y05734 


Homo sapiens 


Human Grb7 effector 7 2412 nrntein 


2156 


99 


341 


AE000497 


Escherichia 
coli 


L-idonate transcriptional regulator 


928 


98 


342 


D90855 


Escherichia 
coli 


glycerol-3 -phosphate dehydrogenase (EC 

1 1 QQ chain A nnaprnhir 


769 


99 


343 


D85613 


Escherichia 
coli 


membrane component 


399 


100 


344 


M93239 


Escherichia 
coli 


transmembrane protein 


232 


100 


345 


M60177 


Escherichia 


enterobactin 


759 


99 


346 


D90699 


Escherichia 


Sensor protein copS (EC 2.7.3.-). 


638 


97 


347 


D90843 


Escherichia 


CapB protein. 


552 


100 


348 


Ml 3422 


Escherichia 
coli 


49 kd protein 


1193 


96 




i_> I Uj-O 


nsci ici lciiid 

coli 


similar to drug resistance translocases 






JJU 




Mus 

mi icpii 1 i ic 


enhancer-trap-locus-1 


JOVJ 


on 


351 


AF239613 


Homo sapiens 


apamin-sensitive small -conductance Ca2+- 


463 


SO 


352 


D90777 


X-<owIlCl IL-llld 

coli 


j-ii^ uroxy L»uiyry i-uoa ucnyurogendse 
dehydrogenase) (BhbD). 


oil 


i no 


353 


D90863 


Escherichia 
coli 


similar in 

j 11 1 1 1 1 til IU 


31 1 




354 


Y52386 


Homo sapiens 


Human transmembrane protein HP02000. 


133 


58 


355 


Y31645 


Homo sapiens 


Human tran snort- associated nrntein-7 fTRANP- 
7). 


482 


55 


356 


Y58637 


Homo sapiens 


l^rotciu regulating gene expression PRGE-30. 


1 19 


51 


357 


AF 119226 


Homo sapiens 


dual-SDeciflcitv tyrosine nhosnhatase YATH1 


1788 


100 


358 


Y87219 


Homo sapiens 


Human secreted protein sequence SEQ ID 
NO:258. 


165 


100 


359 


J00132 


Homo sapiens 


beta-fibrinogen 


233 


93 


360 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


128 


70 


361 


R28916 


Homo sapiens 


Type III procollagen (prior art). 


108 


40 


362 


U 16655 


Rattus 
norvegicus 


phospholipase C delta-4 


649 


65 


363 


G03119 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7200. 


95 


42 


364 


U47276 


Gallus gallus 


chicken brain factor-2 


104 


34 


365 


G037S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


183 


65 


366 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8172. 


118 


46 


367 


X98258 


Homo sapiens 


M-phasc phosphoprotcin 9 


564 


75 


368 


AL021366 


Homo sapiens 


CICK0721Q.3 (Kinesin related protein) 


3387 


99 


369 


U70932 1 


Perftrttv^cn? 

-1 ^p-l wl 11 j 9vU3 

leucopus 




Q7 




370 


X86400 


Homo sapiens 


gamma subunit of sodium potassium ATPase 
like 


242 


73 


371 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7253. 


165 


56 


372 


U49974 


Homo sapiens 


manner transposase 


257 


55 


373 


X13916 


Homo sapiens 


LDL-receptor related precursor (AA -19 to 4525) 


21193 


99 


374 


AF234765 


Rattus 
norvegicus 


serine-arginine-rich splicing regulatory protein 
SRRP86 


1182 


78 


375 


U49974 


Homo sapiens 


mariner transposase 


172 


55 



112 
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LscsLii lpuon 


Smith- 


/O 


ID 


No. 






Waterman 


Tderuif v 


NO: 








Score 




376 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6065. 


221 


67 i 


377 


G00669 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4750. 


600 


100 


378 


X52574 


lvlus 


GTP bin diner nrotein 


1456 


91 






musculus 








379 


R69095 


Homo sapiens 


Anti-Hl V Fab tat3 1 light chain. 


68 


37 


380 


J04974 


Homo sapiens 


alpha-2 type XI collagen 


125 


37 


381 


AB002405 


Homo sapiens 


LAK-4p 


530 


43 


382 


I.J64830 


Dicty ostel i urn 


protein tyrosine kinase 


115 


44 






disco ideum 








383 


G02916 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6997. 


618 


98 


384 


GO 1194 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5275. 


617 


93 


385 


AJ245822 


Homo sapiens 


type F transmembrane receptor 


4560 


100 


3S6 


D86974 


Homo sapiens 


KIAA0220 


2148 


98 


387 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7284. 


142 


50 


388 


G 04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8153. 


99 


59 


389 


M12140 


Homo sapiens 


envelope protein 


197 


51 


390 




ii Willi/ jtl^JlC-Hif 




461 


77 


j/ 1 


Y42751 




Human ffilr'iiim hinrlino nrntPtn 7 /'/~ , yiRP-7 i 
xiuiiiuii uaiciuiii uiiivjixig jjiulciii — ^v^ci v*± — /• 


181 


94 


1Q7 
_jyz 


W tOJ J 1 


rxuniu i>dpiClI£> 


T4iim!)n hr^Qct panwr r^lcat^H nrrttpin R.f*'l?Ti7 
niilllull Ul CdSL Uall^CI ICldlCU piUlClil DL/IVDii. 


241 


66 




V 1 AAA7 


nomo Sapiens 


oiiacicny reoepior protein 








WfiS£H7 
W o jO\j I 


Homo sapiens 


Secreted protein clone da228 6. 


CK7 

yj 1 




J7J 


Y76H7 
I /DjjZ 


Homo sapiens 


rTaglllCIH Ui IlUIIIali bCL-ICLCU piUlClil CLIOUUCU Uy 


1 71 
1/1 










gene 38. 








UUJ7JU 


— — : 

Homo sapiens 


riuman secreceo proiein, oxiv^ llj inu. ouii. 


7^n 


1 UU 


J7/ 


ADuJ Az7\jH 


T-T% /I /"i 1-k ■n4-/*r» 

XiyiCJOaLcS 


dopamine receptor D4 


MS 
l\JD 








Ox f n A a ig 

ij- nciaciyjui 








■3QO 


A TOO.'7*7QQ 


Homo sapiens 


siromai anngen j, ^oi 


£A1 


RS 

OJ 


TOO 




Homo sapiens 


riuman secreieu proxem sequence encouea oy 


1 HA7 


07 

yz 








gene z bh/^/ 1JL> rsU. 12o. 






AOA 
*fUU 


Y zyoo 1 


-r- : 

Homo sapiens 


Human secreted protein clone cb98 4. 




j I 


4UI 


Uo /UUZ 


Homo sapiens 


similar to rat integral membrane glycoprotein; 




7<J 














ah7 


API nn7SA 


noinu sapiens 


dnciciiL UDitjuiLOUis piULcin Aun lboionn 




7 J 






vjrailUS ticiliub 


aipna-z-macrogioouun rcccpLor 


7<:r 

-JO 


ou 


AC\A 

4U4 


ArU O40Z 


Mus 


ADP-ribosylation factor-directed GTPase 


D4D 


oy 






¥Y11 IC/*1 1 ll 1C 

rnusouius 


duLivdiing proiein lsoiorm d 






4UD 


XyZoo / 


Human 


pol/env 


1 A7 
10Z 








CllUUgClJUUb 












retrovirus JC 








nUO 


v*ni fV? 

X jU 1 oz 


rauiiiu sapiens 


Human dorsal root receptor 4 hX)KJ\.4. 




77 
/ Z 


4U / 


AivUZZOZO 


Homo sapiens 


unnamed protein product 




OQ 


4115 


T 1 '3QfJ9 
L,1.3oVZ 


Homo sapiens 


ribosmal protein small subunit 




Q7 

yz 


409 


Y91600 


Homo sapiens 


Human secreted protein sequence encoded by 


1788 








_ . 


gene y oty iu iskj.z / j. 






a i n 


Woo / HO 


Homo sapiens 


Secreted protein encoded by gene 30 clone 


7 AHA 


yy 






__ 


HTSFV09 






41 1 




IvlUS 




?67R 


R7 






mn^ciiln^ 

11 IUjvUIUJ 








412 


Y86233 


Homo ^anipns 


Human <;errprpf1 nrntein FTMTMX29 SF.O ID 


1014 


92 








MO: 148. 






413 


1 I10S42 


Pan 


1Y1.JL 1V_- LlddO i f\ 


265 


71 






troglodytes 








414 


AF 155097 


Homo ^anif*n9 


NY-RFN-7 antieen 

IN I i\J_/J N / £UJL1£,UJ1 


850 


95 


415 


G03203 


llvFltlU auUlwllO 


Human secreted nrotein SFO TD "NO* 72S4 

llUlllcLll j^vl WlwU wlV^l^llt, kJUW A IN w» / ^C^T. 


88 


48 


A1 A 
4 lO 


VS7Q1 1 
Y D /yi 1 


Homo sapiens 


Human transmembrane protein HTMPN-35. 


zoo 


so 


417 


W27651 


Homo ^amf*n<; 
a-lviiiivs oaiJi^iiD 


^pprptpH r>rr»tf*in AT70^ 


481 


60 


418 


Y76884 


Homo sapiens 


Retinoblastoma binding protein-7sequence. 


3077 


87 


419 


AF255559 


Notothenia 


alpha tubulin 


289 


68 






coriiceps 








420 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6065. 


209 


74 


421 


AL 109827 


Homo sapiens 


dJ309K20.2 (acrosomal protein ACR55 (similar 


1446 


96 








to rat sperm antigen 4 (SPAG4))) 






422 


AC008075 


Arabidopsis 


F24I5.4 


112 


35 






thaliana 









113 
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SEQ 

ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


423 


AF231705 


Homo sapiens 


Alu co-repressor 1 


1090 


100 


424 


AF234S87 


Homo sapiens 


FLAMINGO 1 


6268 


97 


425 


Y35942 


Homo sapiens 


Extended human secreted protein sequence, SEQ 
ID NO. 191. 


1961 


99 


426 


AB009288 


Homo sapiens 


N-copine 


635 


9S 


427 


LI 23 92 


Homo sapiens 


Huntington's Disease protein 


16080 


99 


428 


Y94990 


Homo sapiens 


Human secreted protein vb21_l, SEQ ID NO:20. 


768 


98 


429 


AJ293573 


Homo sapiens 


zinc finger protein Cezanne 


542 


87 


430 


Y84441 


Homo sapiens 


Amino acid sequence of a human RNA- 
associated protein. 


2074 


100 


431 


G02850 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6931. 


723 


95 


432 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8148. 


73 


42 


433 


AF 159296 


Lycopersicon 
esculentum 


extensin-like protein 


613 


4$ 


434 


W4835J 


Homo sapiens 


Human breast cancer related protein BCRB2. 


135 


44 


435 


X73874 


Homo sapiens 


phosphorylase kinase 


3442 


97 


436 


AF 161 426 


Homo sapiens 


HSPC308 


268 


74 


437 


Y30812 


Homo sapiens 


Human secreted protein encoded from gene 2. 


1055 


52 


438 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7879. 


168 


56 


439 


XI 4766 


Homo sapiens 


GABA-A receptor alpha 1 subunit 


2294 


96 


440 


X02344 


Homo sapiens 


beta-tubuiin 


311 


95 


441 


AF168418 


Homo sapiens 


activating signal cointegrator 1 


1882 


100 


442 


LI 1672 


Homo sapiens 


zinc finger protein 


795 


54 


443 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7284. 


93 


26 


444 


A52140 


unidentified 


HUMAN NDR 


2451 


100 


445 


X9S330 


Homo sapiens 


ryanodine receptor 2 


9356 


99 


446 


AF1I6712 


Homo sapiens 


PR02738 


227 


49 


447 


AF245447 


Homo sapiens 


sphingosine kinase ty pe 2 isoform 


576 


99 


448 


AF1330S6 


Homo sapiens 


membrane-type serine protease 1 


2630 


94 


449 


U87305 


Rattus 
norvegicus 


transmembrane receptor UNC5H1 


817 


93 


450 


AF08I249 


Homo sapiens 


JAW 1 -related protein MR VI I A long isoform 


4568 


99 


451 


AC005498 


Homo sapiens 


R31665 1 


316 


62 


452 


M60235 


Homo sapiens 


granule membrane protein- 140 


464 


73 


453 


AB036706 


Homo sapiens 


intelectin 


730 


88 


454 


G00918 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4999. 


263 


SI 


455 


Y22634 


Homo sapiens 


Human cytokine inducible regulatory protein- 1 


192 


67 


456 


Y36705 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 62. 


106 


40 


457 


N91325 


Homo sapiens 


DNA encoding human growth hormone receptor. 


3282 


96 


458 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 


1 10 


36 


459 


Y13377 


Homo sapiens 


Amino acid sequence of protein PR0257. 


509 


98 


460 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


149 


43 


461 


Y 14482 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 17. 


184 


54 


462 


Y53005 


Homo sapiens 


Human secreted protein clone pm749_8 protein 
sequence SEQ ID NO: 16. 


135 


47 


463 


X84960 


Triticum 
aestivum 


low molecular weight glutenin 


109 


33 


464 


W19919 


Homo sapiens 


Human Ksr-1 (kinase suppressor of Ras). 


1781 


85 


465 


AF 189764 


Mus 

musculus 


alpha/beta hydrolase- 1 


502 


59 


466 


U93569 


Homo sapiens 


p40 


101 


30 


467 


Y41528 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 77. 


1172 


99 


468 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


149 


52 


469 


AJ000008 


Homo sapiens 


PI3-kinase 


5832 


97 


470 


X70922 


Mus 

musculus 


neurotoxin homologue 


118 


47 


471 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7878. 


198 


75 


472 


Y36705 


Homo sapiens 


Fragment of human secreted protein encoded by 


72 


17 



114 



3DOCID: <WO 01S7168A2_L> 



WO 01/57188 



PCT/U SO 1/03800 



ID 

"MO- 


No. 


O Mvvl wJ 


Description 


Smith- 
Waterman 
Score 


% 

Identity 








gene 62. 






473 


G02313 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6394. 


328 


100 


4.74. 


Y07007 


Homo sapiens 


Breast cancer associated antigen precursor 
sequence. 


1013 


97 


475 


W93254 


Homo sapiens 


Human ESRPi protein. 


943 


80 


*+ /D 


W48351 


Homo sapiens 


Human breast cancer related protein BCRB2. 


236 


65 


477 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


202 


60 


4.7 R 


GO 1870 

V_J \J L %J / \J 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5951. 


267 


100 


470 


AF 102777 


Mus 

musculus 


FYVE finger-containing phosphoinositide kinase 


3427 


92 


48fl 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7133. 


123 


53 


4£1 


W87701 


Homo sapiens 


A human membrane fusion protein designated 
SYTAX1. 


221 


77 


482 


G03119 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7200. 


131 


39 


4R^ 


AF210651 


Homo sapiens 


NAG18 T 


124 


59 


484 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-N r l P 


343 t 


50 


4RS 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4718. 


129 


70 


486 


U15174 


Homo sapiens 


BCL2/adenovirus E1B 19kD-interacting protein 
3 


149 


73 


487 


Y76167 


Homo sapiens 


Human secreted protein encoded by gene 44. 


627 


100 






Homo sapiens 


stabilin-1 


1244 


91 


4RQ 


VJ UJ :70 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7879. 


313 


65 






Homo sapiens 


Huntington's Disease protein 


16081 


100 


491 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


197 


66 








laminin-binding protein 


228 


70 


49d 


I tl CI "7 A 

U ID 1 


Hr»mrv c q r> } f* rt *\ 
iJUl 1 iC jajjiuij 


BCE2/adenovirus E1B 19kD-interacting protein 
3 


128 


41 




y uzoy j 




Human secreted protein encoded by gene 44 
clone HTDAD22. 


197 


67 


40 *\ 




Homo sapiens 


R31449 3 


889 


94 


4S*0 




Homo saDiens 


Human secreted protein, SEQ ID NO: 7867. 


229 


61 


/ 




Canis 
familiaris 


D4 dopamine receptor 


90 


48 


40R 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


228 


65 


490 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


213 


52 




1 148508 


Homo sapiens 


skeletal muscle ryanodine receptor 


26406 


99 


501 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7452. 


105 


58 


JUZ 


AF1 


Homo sapiens 


PRO 1722 


156 


62 


503 


AF1 13685 


Homo sapiens 


PRO0974 


116 


50 


DU4 


T 17Q4^R 


Knmn ^aniens 


WW domain binding protein-2 


322 


59 


DUD 


W00AS1 
W-zOJ 1 


T-lf>mn ^aniens 

X ±\JmI }\J iJtA^J 1 wild 


Human secreted protein CD124_3. 


608 


55 


DUO 




Hnmo saDiens 


Secreted protein encoded by clone dhl 135_9. 


986 


70 


J\J / 


Y 862 65 


Homo sapiens 


Human secreted protein HUSXE77, SEQ JD 
NO:180. 


115 


33 


508 


AL160175 


Homo sapiens 


DA243J16.3 (similar to MYLK (myosin, light 
polypeptide kinase)) 


184 


92 


509 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


97 


62 


•sin 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


117 


63 


D J 1 


W79092 


Homo sapiens 


Human secreted protein dn740_3. 


1058 


100 


J 1 £. 


AFO10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


205 


64 




All ^439 

/A. J 1 JJHJ7 


Homo sapiens 


GR1P1 protein 


2151 


100 


514 


AE003456 


Drosophila 
melanogaster 


CG6393 gene product 


259 


42 . 


515 


Z17206 


Xenopus 
laevis 


P 46XlEg22 


128 


4U 


516 


AF104413 


Homo sapiens 


large tumor suppressor 1 


1766 


94 


517 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7878. 


92 


40 


518 


AF151083 


Homo sapiens 


HSPC249 


444 


98 


519 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


318 


50 


520 


X92485 


Plasmodium 
vivax 


pval 


170 


61 



115 



sJSDOCID <WO 0157188A2_I_> 



WO 01/57188 




T/US01/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


521 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7871. 


159 


59 


522 


AF121857 


Homo sapiens 


sorting nexin 7 


259 


40 


523 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6735. 


82 


37 


524 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 clone 
HPMBQ32. 


253 


73 


525 


AF 119851 


Homo sapiens 


PRO 1722 


162 


57 


526 


Y27761 


Homo sapiens 


Human secreted protein encoded by gene No. 47. 


154 


57 


527 


G02707 


Homo sapiens 


Human secreted protein,. SEQ ID NO: 6788. 


70 


45 


528 


U47924 


Homo sapiens 


C8 


1112 


86 


529 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8144. 


84 


45 


530 


G03203 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7284. 


111 


60 


531 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: S14S. 


92 


65 


532 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7348. 


75 


29 


533 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7284. 


182 


48 


534 


AF068286 


Homo sapiens 


HDCMD38P 


861 


100 


535 


U07707 


Homo sapiens 


epidermal growth factor receptor substrate 


228 


60 


536 


GO 1955 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6036. 


484 


75 


537 


AF219232 


GaJlus gallus 


qin-induced kinase 


206 


53 


538 


AF 135022 


Homo sapiens 


mediator 


128 


100 


539 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7348. 


141 


59 


540 


AFO 16430 


Cacnorhabditi 
s elegans 


contains similarity to a BR-C/TTK. domain 


853 


39 


541 


AC003093 


Homo sapiens 


OXYSTEROL-BINDING PROTEFN; 45% 
similarity to P22059 (PID:gl 29308) 


408 


66 


542 


M29487 


Homo sapiens 


integrin alpha subunit precursor 


517 


81 


543 


AF 102530 


Mus 

musculus 


olfactory receptor F3 


327 


73 


544 


Y7343 1 


Homo sapiens 


Human secreted protein clone ybl86 1 protein 
sequence SEQ ID NO :84. 


386 


100 


545 


AE004833 


Pseudomonas 
aeruginosa 


probable TonB-dcpendcnt receptor 


279 


42 


546 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7874. 


264 


53 


547 


Y69192 


Homo sapiens 


A human monocyte-macrophage apolipoprotein 
B receptor protein. 


1772 


67 


548 


Y91493 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 43 SEQ ID NO: 166. 


176 


100 


549 


GO 1 571 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5652. 


777 


99 


550 


AF0445S8 


Homo sapiens 


protein regulating cytokinesis 1: PRC1 


1953 


88 


551 


Y29332 


Homo sapiens 


Human secreted protein clone pe584_2 protein 
sequence. 


1224 


94 


552 


X98330 


Homo sapiens 


ryanodine receptor 2 


24621 


99 


553 


Y42782 


Homo sapiens 


Human UC Band #331 protein. 


684 


95 


554 


AB025258 


Mus 

musculus 


granuphilin-a 


501 | 


41 


555 


AJ010346 


Homo sapiens 


RJNG-H2 


1468 


100 


556 


W92388 


Homo sapiens 


Human TR-interacting protein S239a. 


538 


92 


557 


AF1 19851 


Homo sapiens 


PRO 1722 


175 


59 


558 


AF 11 7756 


Humo sapiens 


thyroid hormone receptor-associated protein 
complex component TRAP 150 


183 


32 


559 


G02872 


Homo sapiens 


Human secreted protein. SEQ ID NO: 6953. 


319 


6S 


560 


D86214 


Mus 

musculus 


Ca2-r dependent activator protein for secretion 


1010 


93 


561 


AF 187325 


Canis 
famiJiaris 


melanoma antigen 


287 


55 


562 


AJ001981 


Homo sapiens 


OXA1L 


2512 


99 


563 


Z17238 


Ratlus 
norvegicus 


glutamate receptor subtype delta- 1 


338 


66 


564 


W30638 


Homo sapiens 


Partial human 7-transmembrane receptor 
HAPO 167 protein. 


371 


100 


565 


AC005620 


Homo sapiens 


R33590_l 


467 


97 


566 


Y9935S 


Homo sapiens 


Human PRO 1772 (UNQ834) amino acid 
sequence SEQ IDNO:63. 


1138 


78 


567 


AL031177 


Homo sapiens 


dJ8S9M15.3 (novel protein) 


1002 


58 


568 


AF151043 


Homo sapiens 


HSPC209 798 


100 



116 



WO 01/57188 ^ ^ PCT/US01/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


569 


Ah 09751 a 


: 

Homo sapiens 


1 i \/#*r , _cr%f»r>i'Fi r* ft* anennrtpr 


231 


100 


570 


AB035698 


Homo sapiens 


1VL lis snap en/ INlrv-rcialou Kinase- iviai^i-v x 


1532 


100 


571 


Y07096 


Homo sapiens 


V»>010n Cancer abbUClaUSU a_uii£,^ii piv^vLnowj 


1064 


100 ' 


572 


AL03U77 


Homo sapiens 


dJ889M15.3 (novel protein) 


735 


55 


573 


Y66639 


Homo sapiens 


iviemDr<ine"Duunu piotcui ri\u-7v. 


254 


45 


574 


AB037108 


Homo sapiens 


&.-\7 **.r\ Tr*»nein^mkran/» finmfltn nmh^n rc^CCT^IOr 

seven inirisiiici iiui aiic uvjuioiu ui^jntui 


1883 


99 


575 


D43949 


Homo sapiens 


TV-tic rr*»r»o i c r»rvi/#»l 

l nis tjcue is novci. 


836 


100 


576 


Y48596 


Homo sapiens 


Human breast tumour-associated protein 57. 


108 


50 


577 


G00352 


Homo sapiens 


Human secreieu protein, oi-*v/ llj inu, ^jj- 


141 


75 


578 


R95913 


Flomo sapiens 


Neural thread protein. 


140 


65 


579 


AK025116 


Homo sapiens 


unnamed protein product 


201 


70 


580 


Y86473 


Homo sapiens 


Human gene Dz-encooea proiein irdgrueui, oj^v 
ID NO:3o&. 


77 


70 


581 


AF196779 


Homo sapiens 


JM10 protein 


450 


100 


582 


AF 1 88706 


Flomo sapiens 


g20 protein 


330 


98 


583 


AB030234 


Canis 
familiaris 


D4 dopamine receptor 


64 


56 


584 


G02621 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6702. 


345 


90 


585 


AL096828 


Homo sapiens 


dJ963E22.1 (Novel protein similar to in i -jsx-in-z 
Antigen) 


268 


85 


586 


Y30819 


Homo sapiens 


Human secreted protein encoded from gene 9. 


235 


35 


587 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4438. 


132 


56 


588 


G02872 


Homo sapiens 


Human secreted protein. SEQ ID NO: 69D3. 


1 R'} 


79 


589 


AF235017 


Mus 

musculus 


2P1 protein 

. — 


764 


80 


590 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 clone 
HPMBQ32. 




O 1 


591 


Y30709 


Flomo sapiens 


Amino acid sequence of a human secreted 
protein. 


110 


43 


592 


Y53875 


Homo sapiens 


A human seven transmembrane signal transducer 
polypeptide. 


1369 


92 


593 


Y53051 


Homo sapiens 


Human secreted protein clone ddl 19_4 protein 
sequence H-J inu.iuo. 


1112 


97 


594 


Y27658 


Homo sapiens 


Human secreted protein encoded by gene No. 92, 


763 


79 


595 


G03798 


Flomo sapiens 


Human secreted protein, bi^j iu invj. to/?. 


156 


58 


596 


AFI51110 


Mus 

musculus 


COP1 protein 


2215 


95 


597 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7867. 


157 


65 


598 


AF 192499 


Mus 

musculus 


putative secreted protein ZSIG37 


143 


40 


599 


AF1 19855 


Homo sapiens 


rt\XJ 1 o<+ / 


236 


76 


600 


G02872 


Homo sapiens 


riuman secreicu proicjrj, iu iwj. uy~j~>. 


212 


73 


601 


Y00295 


Homo sapiens 


Human secreieu proicm cu^uueu uy g,cii^ jo. 


567 


88 


602 


AF 184971 


Homo sapiens 


class II cytokine receptor ZCYTOR7 


2015 


74 


603 


AF061936 


Flomo sapiens 


diacylglycerol kinase iota 


773 


96 


604 


AL096828 


Homo sapiens 


dJ963E22.1 (Novel protein similar to NY-REN-2 
Antigen) 


1333 


93 


605 


AB033106 


Homo sapiens 


rviAAizou proiein 


3915 


100 


606 


X75756 


Homo sapiens 


protein kinase C mu 


3916 


99 


607 


D86983 


Homo sapiens 


similar to D.melanogaster peroxidasin(U 11052) 


5758 


99 | 


608 


W69341 


Homo sapiens 


Secreted protein oi clone v^kjZ fy l . 


1 S77 


99 


609 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 clone 
HPMBQ32. 


339 


82 


610 


Y27868 


Homo sapiens 


Human secreted protein encoded by gene No. 
107. 


1 1 6 


62 


611 


AF202636 


Homo sapiens 


angiopoietin-like protein PP1158 


2164 


100 


612 


AF090944 


Homo sapiens 


PRO0663 


218 


82 


613 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


195 


59 


614 


M87053 


Ratlus 
norvegicus 


lens membrane protein 


450 


84 


615 


AC004232 


Homo sapiens 


FPM315 


163 


37 


616 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6065. 


205 


79 



117 



MSDOCID: <WO 0157188A2_L> 



WO 01/57188 




CT/USO 1/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

J rfentitv 

XUUIJU IJr 


617 


Y91524 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 74 SEQ ID NO: 197. 


821 


99 ~ 


618 


AJ245621 


Homo sapiens 


CTL2 protein 


2258 


99 


619 


Y76198 


Homo sapiens 


Human secreted protein encoded by gene 75. 


108 


64 


620 


AF067S64 


Homo sapiens 


transferrin receptor 2 alpha 


3922 


94 


621 


D90721 


Escherichia 
coli 


Transmembrane protein dppC 


573 


90 


622 


W75858 


Homo sapiens 


Human secretory protein of clone CS752-3. 


710 


100 


623 


Y949S2 


Homo sapiens 


Human secreted protein vbl2 1, SEQ ID NO:4. 


733 


100 


624 


AF034745 


Mus 

musculus 


LNXp80 


617 


83 


625 


U42580 


Paramecium 
bursaria 
Chlorella 
virus 1 


Pro-rich, IPPPNMSLPLS (3x) 


94 


46 


626 


U79260 


Homo sapiens 


unknown 


194 


70 


627 


R95913 


Homo sapiens 


Neural thread protein. 


99 


50 


628 


G03450 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7531. 


427 


100 


629 


Y36281 


Homo sapiens 


Human secreted protein encoded by gene 58. 


590 


100 


630 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD'>2 


165 


76 


631 


G02139 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6220. 


268 




632 


U 3 6996 


Homo sapiens 


protein tyrosine posphata.se 


351 


Ov 


633 


AF12IS57 


Homo sapiens 


sorting nexin 7 


2019 


100 


634 


AF28377^> 


11U1HU odpiwllo 


Qimiliir tn T-T i \ t~y> cunipnv pi Kncritn q 1 nrAtoin I 1 Pi 
oUlllldl iu jnuiUU iajjlClli I IUUdUIIIciI prULCIu L-lvJ 

encoded by GenBank Accession Number 
L25899 




"7"7 


635 


Y07090 


Homo sapiens 


Renal cancer a^ociated antioen nreoiir<;nr 

sequence. 


077 




636 


ABO 13382 


l~Tr\mn ^anien^ 

.1. i\_Ml iVJ OtiJJ 1 VI J o 


DUSP6 


A\ A 
*r I *t 


/o 


637 


G02S72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


315 


71 


638 


M 95 762 


R afftic 

norvegicus 


rj4RA tmncnnrfpr 
vjrVDrV ii «uloUUI LCI 




CO 


639 


G03789 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7870. 


219 


60 


640 


Y01400 


Hnnin ^aniens 


HNHF029. 


1 17 


70 


641 


AC008075 


Arabidopsis 
thai i ana 


F24J5.4 


121 




642 


W74824 


Homo sapiens 


Human secreted protein encoded by gene 96 
clone HAQBK61. 


615 


62 


643 


ABO 15982 


Homo sapiens 


serine/threonine kinase 


485 


98 


644 


Y25806 


Homo sapiens 


Human secreted protein fragment encoded from 
gene 23. 


162 


46 


645 


AF 122904 


Homo sapiens 


membrane protein DAP 10 


474 


100 


646 


AF233323 


Homo sapiens 


Fas-associated phosphatase- 1 


200 


38 


647 


W48804 


Homo sapiens 


Homo sapiens clone BK158 1 protein. 


1203 


99 


648 


AF257330 


Homo sapiens 


COBW-Iike protein 


1440 


98 


649 


Y36203 


Homo sapiens 


Human secreted protein #75. 


233 


73 


650 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


173 


78 


651 


Y32199 


Homo sapiens 


Human recentor mnlecnlf* fRFf~^ pnrnHpH hv 

Incyte clone 2022379. 


1012 

JUJZ 


1 Oft 


652 


AB032909 


Hylobates 
agilis 


dopamine receptor D4 


122 


32 


653 


AK021848 


Homo sapiens 


unnamed protein product 






654 


W73411 


Homo sapiens 


Human secreted nrotein encoded hv C^pne Nn 
15. 


57 


37 


655 


L22455 


Rattus 
norvegicus 


mu opioid receptor 


116 


34 


656 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7193. 


no 


45 


657 


G02345 


Homo sapiens 


Human secreted protein. SEQ ID NO: 6426. 


459 


97 


658 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 clone 
HPMBQ32. 


291 


75 


659 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6913. 


134 


65 


660 


Y91423 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 11 SEQ ID NO: 144. 


333 


96 



118 



3DOCID: <WO 0157188A2_I_> 



WO 01/57188 M W PCT/US01/03800 



SEQ 

ID 

NO: 


Accession 
"No. 


Species 


Description 


Smith- 

Waterman 

YY tXvwl 1 1 mil 

Score 


% 

Identity 


661 


G03789 


Homo sapiens 


Human secreted proiem, iu inw. /o/u. 


168 


68 


662 


Y53886 


Homo sapiens 


A suppressor of cytokine signalling protein 
designated HSCOP-6. 


375 


43 


663 


W75771 


Homo sapiens 


riuman oil oinoing pioiein /vri^uo. 


629 


100 


664 


AL096770 


Homo sapiens 


bA150A6.2 (novel 7 transmembrane receptor 
(rnouopsin larouyj ipnaciory reccpiui hk-c; 
protein (hs6Ml-21)) 


480 


55 


665 


AB037734 


Homo sapiens 


K1AA 1 3 1 3 protein 


978 


96 


666 


W82841 


Homo sapiens 


Human cerebral protein- 1. 


192 


84 


667 


W8^841 


Homo sapiens 


Human cerebral protein- 1. 


182 


87 


668 


AB030184 


Mus 

musculus 


contains transmembrane (TM) region and ATP 
binding region 


757 


68 


669 


AB032919 


Hylobates 
muelleri 


dopamine receptor D4 


85 


37 


670 


AF 107295 


Rattus 
norvegicus 


outer membrane protein 


746 


81 


671 


Z33642 


Homo sapiens 


leukocyte surface protein 


394 


93 


672 


W85608 


Homo sapiens 


Secreted protein clone du410_5. 


261 


91 


673 


G03203 


Homo sapiens 


Human secreted protein, onv i N v^« /— o^- 


106 


48 


674 


AL035587 


Homo sapiens 


dJ475N16.4 (K1AA0240) 


2388 


99 


675 


Y59668 


Homo sapiens 


Secreted protein lU8-UCl>3-(J-d-ri^. 


1 1 34 


53 


676 


G03797 


Homo sapiens 


Human secreted protein, ofcV 1JJ inu: /o/o. 


174 


74 


677 


AF026954 


Bos taurus 


pyruvate dehydrogenase phosphatase regulatory 
subunit precursor; PDPr 


ion 


95 


678 


LI 1625 


Mus 

musculus 


receptor protein-tyrosine kinase 




96 


679 


AL031427 


Homo sapiens 


d J 1 67A 1 9.3 (novel protein) 




100 


680 


AJ133430 


Mus 

musculus 


olfactory receptor 


OR 


77 


681 


G02532 


Homo sapiens 


Human secreted proiem, ^nv^ iu jnu. ddij. 


179 


70 


682 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


336 


76 


683 


Y94943 


Homo sapiens 


Human secreted protein clone ytl4_l protein 
sequence SEQ ID NO:92. 


l 1 o 


100 


684 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


100 


37 


685 


G00885 


Homo sapiens 


Human secreted protein, bbv^ iu invj. -Hyoo. 


162 


60 


686 


AK001518 


Homo sapiens 


unnamed protein product 


590 


100 


687 


GO 1982 


Homo sapiens 


Human secreted protem, SEQ IDNU: oUoJ. 


71 S 


100 


688 


Y92241 


Homo sapiens 


Human cancer associated antigen precursor 
(MO-REN -46). 




99 


689 


AC024792 


Caenorhabditi 
s elegans 


contains similarity to lR.P/oJlo 


423 


36 


690 


Y27868 


Homo sapiens 


rliiman sccreieu protein cijccnjca vy gcjic jnu. 
107. 


183 


81 


691 


Y56514 


Homo sapiens 


Human JurKat ceu cione rz-i j /\iiyiiu luntcbi 
ORF protein sequence. 


180 


88 


692 


Y27795 


Homo sapiens 


riuman secreieu proiciii ciil-uucu vy gcuc nu. > >> 


1539 


"99 


693 


Y3626S 


Homo sapiens 


Human secreted protein encoded by gene 45. 


428 


9S 


694 


U 12465 


Homo sapiens 


ribosomal protein L35 


308 


89 ! 


695 


Y45272 


Homo sapiens 


Human secreted protein encoded from gene 16. 


1517 


99 


696 


AF191838 


Homo sapiens 


TANK binding kinase ltSKi 


1242 


98 


697 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone H T DAD22. 


275 


75 


698 


Y87280 


Homo sapiens 


Human signal peptide containing protein HSPP- 
.57 SEQ ID NO:57. 


576 


90 


699 


Y97999 


Homo sapiens 


Human SCAD lamily molecule nirM-i, 
ID NO: 1 - 


77Q 


99 


700 


AJ006701 


Homo sapiens 


putative serine/threonine protein kinase 


610 


79 


701 


AF209198 


Homo sapiens 


zinc finger protein 277 


2357 


100 


702 


AJ298841 


Mus 

musculus 


torsinA protein 


709 


45 


703 


AK021729 


Homo sapiens 


unnamed protein product 


622 


98 


704 


Z46787 


Caenorhabditi 
s elegans 


similar to Glutaredoxin, Zinc finger, C3HC4 
type (RING finger) 


920 


51 


705 


G02882 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6963. 


589 


98 



119 



vJSDOCID: <WO 0157188A2J_> 



WO 01/57188 




T/US01/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 

Waierman 

Score 


% 

Identity 


706 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6582. 


125 


58 


707 


R95326 


Homo sapiens 


Tumor necrosis factor receptor 1 death domain 
ligand (clone 2DD). 


121 


95 


708 


G03002 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7083. 


125 


39 


709 


Y96202 


Homo sapiens 


IkappaB kinase (IKK) binding protein, Y2H56. 


516 


98 


710 


M63577 


Saccharomyc 
cs cercvisiac 


SFP1 


131 


59 


711 


AB026291 


Rattus 
norvegicus 


acetoaceryl-CoA synthetase 


467 


85 


712 


D21211 


Homo sapiens 


protein tyrosine phosphatase (PTP-BAS, type 3) 


368 


44 


713 


AF044033 


Marmota 
marmota 


olfactory receptor 


615 


83 | 


714 


G03561 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7642. 


251 


100 


715 


AB033062 


Homo sapiens 


KIAA1 23 6 protein 


1380 


100 


716 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4658. 


80 


73 


717 


Y96S64 


Homo sapiens 


SEQ. ID. 37 from WO0034474. 


835 


99 


718 


AJ243396 


Homo sapiens 


voltage-gated sodium channel beta-3 subunit 


234 


100 


719 


U47334 


Homo sapiens 


similar to chicken gamma aminobutyric acid 
receptor beta4 subunit 


578 


99 


720 


AB020598 


Homo sapiens 


peptide transporter 3 


1096 


100 


721 


Y53886 


Homo sapiens 


A suppressor of cytokine signalling protein 
designated HSCOP-6. 


570 


74 


722 


J05046 


Homo sapiens 


insulin receptor-related receptor 


6787 


100 


723 


AF00195S 


Ambystoma 
tigrinum 


electrogenic Na+ bicarbonate cotransporter; 
NBC 


111 


41 


724 


AF 127084 


Mus 

musculus 


semaphorin cytoplasmic domain-associated 
protein 3 A 


5253 


94 


725 


X54673 


Homo sapiens 


GABA transporter 


3114 


99 


726 


AF016191 


Rattus 
norvegicus 


potassium channel 


370 


100 


727 


AB029559 


Rattus 
norvegicus 


BAT I 


139 


35 


728 


Y28503 


Homo sapiens 


HGFH3 Human Growth Factor Homologue 3. 


2186 


97 


729 


AJ011415 


Homo sapiens 


plexin-Bl/SEP receptor 


729 


56 


730 


Z93096 


Homo sapiens 


bK390B3.1 (manic fringe (Drosophila) 
homolog) 


142 


68 


731 


Z10062 


Homo sapiens 


cDNA encoding a human vanilloid receptor 
homologue Vanilrepl . 


675 


99 


732 


AF161382 


Homo sapiens 


HSPC264 


492 


94 


733 


AB029033 


Homo sapiens 


K1AA1 1 10 protein 


3826 


99 


734 


AE000493 


Escherichia 
coli 


putative transport protein 


592 


97 


735 


AL033379 


Homo sapiens 


dJ41 7022.2 (novel 7 transmembrane receptor 
(rhodopsin family) protein similar to high- 
affinity lysophosphatidic acid receptor homolog) 


2173 


99 


736 


AF 132599 


Homo sapiens 


RANTES factor of late activated T ly mphocytes- 
1 


245 


56 


737 


X55019 


Homo sapiens 


acetylcholine receptor delta subunit 


883 


99 


738 


X91906 


Homo sapiens 


voltage-gated chloride ion channel 


1978 


100 


739 


AB026116 


Homo sapiens 


organic anion transporter 4 


1444 


98 


740 


D00570 


Mus 

musculus 


open reading frame (196 AA) 


83 


24 


741 


W03626 


Homo sapiens 


Human thyrotropin GPR N-terminal sequence. 


118 


40 


742 


U66059 


Homo sapiens 


V_segment translation product 


614 


100 


743 


AF1 19815 


Homo sapiens 


G-protein-coupled receptor 


2751 


99 


744 


XI 6663 


Homo sapiens 


haematopoietic lineage cell protein (AA 1-486) 


148 


93 


745 


W67838 


Homo sapiens 


Human secreted protein encoded by gene 32 
clone HLTCJ63. 


448 


95 


746 


W57260 


Homo sapiens 


Human semaphorin Y. 


2414 


100 


747 


W21578 


Homo sapiens 


Alzheimer's disease protein encoded by DNA 
from plasmid pGCS2232. 


968 


65 


748 


Y94935 


Homo sapiens 


Human secreted protein clone yd218_I protein 
sequence SEQ ID NO:76. 


622 


100 


749 


AL022238 


Homo sapiens 


dJ1042K10.5 (novel protein) 


314 


85 


750 


G03889 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7970. 


391 


87 



120 



3DOCID: <WO 0157168A2J_> 



WO01/57188 — — PCT/USO./03800 



SEQ 
ID 

NO: 



Accession 
No. 



Species 



Description 



Smith- 
Waterman 
Score 



% 

Identity 



751 



752 



753 
754 



755 



756 



757 



758 



759 



760 



761 



762 



763 
764 



AB025258 



Mus 

musculus 



granuphilin-a 



Y52386 



Homo sapiens^ 



Human transmembrane protein HP02000. 



900 



Y4S586 



Homo sapiens 



Human breast tumour-associated protein 47. 



2527 



AJ272207 



Homo sapiens 



putative G protein-coupled receptor 92 



694 



M85183 



Rattus 
norvegicus 



vasopressin receptor 



AF 19050 I 



Homo sapiens 



leucine-rich repeat-containing G protein-coupled 
receptor 6 



388 



Y02692 



Homo sapiens 



"Human secreted protein encoded by gene 43 
clone HTADX1 7. . 



Z22535 



Homo sapiens 



R04932 



Homo sapiens 



W74902 



Homo sapiens 



G03706 



AB020676 



Homo sapiens 
Homo sapiens 



AK026992 



Homo sapiens 



ALK-3 



439 



Interferon-gamma receptor segment from clone 
39 responsiblefor binding the target. 



564 



Human secreted protein encoded by gene 175 
clone HE8BI92. 



1217 



Human secreted protein, SEQ ID NO: 7787. 



223 



KIAA0869 protein 



4433 



unnamed protein product 



2285 



99 



99 



100 



68 



71 



87 



98 



97 



99 



99 



99 



765 



766 



767 



768 



769 



770 



771 



772 



773 



774 



775 



776 



777 



778 



779 



780 



781 



782 



783 



791 



792 



793 



794 



795 



AF17335S 



Homo sapiens 



glucocorticoid receptor AF-1 coactivator- 1 



573 



AF268066 



Y48585 



Mus 

musculus 



netrin 4 



AF23037S 



AM 2 1975 



AB008515 



Y09945 



AF226731 
Y27132 



Homo sapiens 



Mus 
musculus 



Hum an breast tumour-associated protein 46. 
interleukin-1 del La 



Mus 

musculus 



odorant receptor S 1 8 



Homo sapiens 



RanBPM 



Rattus 
norvegicus 



putative integral membrane transport protein 



Homo sapiens 



AD026 



Homo sapiens 



X87832 



AB025258 



AF125101 



G02815 



G02493 



R03301 



AL357374 



Human glioblastoma-dcrived polypeptide (clone 
OA004FG). _ 



Homo sapiens 



NOV/plexin-Al protein 



2019 
TT69" 



309 



611 



458 



688 



1384 



1821 



Mus 

musculus 



granuphilin-a 



Homo sapiens 



HSPC040 protein 



Homo sapiens 



Human secreted protein, SEQ ID NO: 6896 



Homo sapiens 



Human secreted protein, SEQ ID NO: 6574. 



Homo sapiens 



AF 100346 



Y19566 



Y36233 



AF084464 



Homo sapiens 



Sequence of pre-human atrial natriuretic peptide. 



bA353C18.2 (novel protein) 



Homo sapiens 



neuronal voltage gated calcium channel gamma- 
3 subunit 



Homo sapiens 



Homo sapiens 



Rattus 
norvegicus 



Y08639 



Y41706 



AF121228 



G04072 



Y69384 



W40215 



Homo sapiens 



Homo sapiens ■ r * . — 

Homo sapiens thyroid hormone receptor-associated protein 
complex component TRAP95 



Amino acid sequence of a human secreted 



protein. . _ 

Human secreted protein encoded by gene 10. 



GTP-binding protein REM2 



Human low density lipoprotein binding protein 




500 



314 



191 



213 



>32 



1434 



103 



1098 



141 



2693 



nuclear orphan receptor RQR-beta 



Human PRQ381 protein sequence 



Homo sapiens 



Homo sapiens 



Homo sapiens 



Human secreted protein, SEQ ID NO: 8153. 



Amino acid sequence of a 14274 receptor 
protein. 



Human macrophage antigen^ 



1421 
644 



1037 



124 



119 



1358 



89 



89 



45 



57 



50 



99 



100 



98 



41 



93 



95 



68 



45 



100 



89 



52 



93 



30 



99 



95 
99 



100 



62 



100 



99 



121 



MSDOCID <WO 01571 68A2_I_> 



WO 01/57188 




'CT/US0 1/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identitv 


797 


AF258340 


Homo sapiens 


hepatocellular carcinoma-associated antigen 1 12 


1151 


99 


798 


AF159615 


Homo sapiens 


FGF receptor activating protein 1 


461 


98 


799 


Y59863 


Homo sapiens 


Human normal uterus tissue derived protein 26. 


797 


99 


800 


W70459 


Homo sapiens 


Human Tl -receptor ligand HI splice variant 2. 


572 


92 


801 


LO0073 


Homo sapiens 


renin 


1913 


93 


802 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


1 1963 


97 


803 


X15357 


Homo sapiens 


ANP-A receptor preprotein (AA -32 to 1029) 


5199 


98 


804 


W64473 


Homo sapiens 


Human secreted protein from clone EC1 72 1. 


4018 


95 


805 


AJ243874 


Homo sapiens 


ol igophrenin-4 


2067 


100 


806 


G01731 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5812. 


284 


100 


807 


Z24680 


Homo sapiens 


garp 


1562 


83 


808 


AF171669 


Homo sapiens 


glycoprotein-associated amino acid transporter 
LAT2 


1364 


90 


809 


W70321 


Homo sapiens 


Secreted Drotein CC 198 1 


1 154 


96 


810 


W74S43 


Homo sapiens 


Human secreted protein encoded by gene 1 1 5 
clone HOVBA03. 


855 


99 


811 


AF 108831 


Homo <mnien<; 


TC'f"'! potran^norfrpr 3 


4561 


100 


812 


AF092135 


Homo <sanif»n<; 

1 1UI1IU DCl^J 1^.110 


PTD0 1 4 




i nn 


813 


AF2S3772 


Homo sapiens 


similar to Homo Qaoif*n^ riho^omal orotpin Tin 

encoded by GenBank Accession Number 
L25899 


784 


100 


814 


GO 1563 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5644. 


330 


100 


S15 


AF051151 


Homo sapiens 


ToII/interleukin-1 receptor-like protein 3 


3850 


99 


816 


W95630 


Homo sapiens 


Homo sapiens secreted protein gene clone 
gnl!4 1. 


358 


100 


817 


GO 1 082 


Homo sapiens 


Human secreted protein, SEQ [D NO: 5163. 


549 


100 


818 


AF151800 


Homo sapiens 


CGI-41 protein 


1106 


95 


819 


L00352 


Homo sapiens 


low densitv lipoprotein receptor 


3980 


100 


820 


X04434 


Homo sapiens 


IGF-I receptor 


5832 


99 


821 


G03844 


Homo Wni^ns 


Human <5t»rrf*teH nrnt<*in 'sFO TD "MCV 




inn 


822 


AF2 122?0 


Homo «;anipn? 


TERA 






823 


Y50125 


Homo <;anien^ 


Human crlvoonhocnhatirlvlinn^i tol-anphorf*H 

protein GPI- 122. 


4897 


QU 


S24 


AF1 56778 


Homo SRoipns 


ASH-'} nrotein 


2675 


PR 


825 


AF096322 


Homo ^aniefTs 


npiironal voltatfp-onlt*ri f*alpinm rharinf 1 onmma- 

UvUlUlJUI WilUgL'gdL\.U ^IIUUIJIL-I ^Cllillllu 

2 subunit 


1 1 05 


inn 


826 


Y07972 


Homo sapiens 


Human secreted protein fragment $ 2 encoded 
from gene 28. 


1540 


100 


827 


AB032013 


Homo sapiens 


potassium channel K.v8.1 


2435 


95 


828 


Y 13620 


Homo sapiens 


BCL9 


5284 


96 


829 


Y91474 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 24 SEQ ID NO:147. 


541 


98 


830 


X54232 


Homo sapiens 


glypican 


1625 


87 


831 


X14830 


Homo sapiens 


acetylcholine receptor beta-subunit preprotein 


2540 


100 


832 


Y71262 . 


Homo sapiens 


Human chondromodulin-like protein, Zchmi. 


1002 


100 


833 


G03S73 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7954. 


638 


96 


834 


AC0O3O3O 


Homo sapiens 


R29828_l 


1389 


93 


835 


Y3S422 


Homo sapiens 


Human secreted protein. 


964 


87 


836 


U41557 


Caenorhabditi 
s elegans 


glycine-rich 


85 


36 


837 


AL121889 


Homo sapiens 


dJ1076E17. 1 (K1AA0823 protein (continues in 
AL023803)) 


998 


75 


838 


AJ011415 


Homo sapiens 


plexin-Bl/SEP receptor 


1580 


60 


839 


W80398 


Homo sapiens 


A secreted protein encoded by clone cwl543 3. 


1 105 


67 


840 


G00862 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4943. 


255 


92 


841 


G02650 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6731. 


644 


97 


842 


AF036717 


Homo sapiens 


FGFR signalling adaptor SNT-1 


2629 


99 


843 


Y73446 


Homo sapiens 


Human secreted protein clone yc27_l protein 
sequence SEQ ID NO: 1 1 4. 


1089 


100 


844 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


357 


69 


S45 


AF15I810 


Homo sapiens 


CGI-52 protein 


1443 


8S 


846 


X83378 


Homo sapiens 


putative chloride channel 


1620 


99 


847 


AC004883 


Homo sapiens 


similar to general transcription factor 21; similar 


655 


96 



122 



SDOCID: <WO 0157188A2_L> 



WO 01/57188 



PCT/U SO 1/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 








to Ar03i5969 (PlU.gZaZ/zu /; 






848 


X998S6 


Homo sapiens 


monocyte chemotactic protein-2 


160 


76 


849 


AC005587 


Homo sapiens 


similar to mouse olfactory receptor 13; similar to 
P349S4 (PJD:g464Jlo; 


963 


98 


850 


AB038237 


Homo sapiens 


G protein -coupled receptor C5E.2 


1767 


100 


851 


AF 124490 


Homo sapiens 


ARF GTPase-activating protein GIT1 


3415 


98 


852 


YS6217 


Homo sapiens 


Human secreted protcm hwikjUD**, c»la<> lu 
NO. 132. 


1 IRQ 


"99 


853 


AF224741 


Homo sapiens 


chloride channel protein 7 


3748 


99 


854 


X17094 


Homo sapiens 


iunn (AA l-7y4) 


3550 


99 


855 


W78245 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 19. 


1245 


99 


856 


R97569 


Homo sapiens 


lnterleukin-2 receptor associated protein p43. 


1926 


100 


857 


Y41765 


Homo sapiens 


Human PRO1083 protein sequence. 


321 1 


99 


858 


AF057306 


Homo sapiens 


transmembrane proteolipid 


4R1 


84 


859 


AK025116 


Homo sapiens 


unnamed protein product 


lid 


69 


860 


Y41312 


Homo sapiens 


Human secreted protein encoded by gene 5 clone 
HLDRM43. 


824 


100 


862 


Y25776 


Homo sapiens 


Human secreted protein encoded from gene 66. 




99 


863 


Y74188 


Homo sapiens 


Human prostate tumor bb 1 iragment aenvea 
protein #375. 




30 


864 


AF 167473 


Homo sapiens 


heme-binding protein 




99 


865 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: oolJ. 


— J J 


67 


866 


X54870 


Homo sapiens 


Type 11 integral membrane protein 




100 


867 


G00700 


Homo sapiens 


Human secreted protein, SEQ ID NU. 4/61. 




99 


868 


Y07894 


Homo sapiens 


Human secreted protein fragment encoded from 
gene 43. 


ICQ 


RR 


869 


J00123 


Homo sapiens 


preproenkephalin ( 


1349 


95 


870 


Y91632 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 25 SEQ ID NO:305. 


1048 


98 


871 


L04311 


Homo sapiens 


GABA-alpha receptor beta-3 subunit 


237 


93 


872 • 


Y29988 


Homo sapiens 


Human cytokine family member EF-7 protein. 


960 


94 j 


873 


AF161382 


Homo sapiens 


HSPC264 


1 1 7d 


99 


874 


G03412 


Homo sapiens 


Human secreted protein, btQ ID NU. /4yj. 


HD't 


100 


875 


Y27572 


Homo sapiens 


Human secreted protein encoded by gene No. 6. 


^73 


96 


876 


M15530 


Homo sapiens 


B-cell growth factor 


171 
ill 


56 


877 


W63681 


Homo sapiens 


Human secreted protein 1 . 


1 fiS7 


99 


878 


L27867 


Rattus 
norvegicus 


neurexophilin 
- — ■ 


1448 


98 


879 


Y10835 


Homo sapiens 


Amino acid sequence of a human secreted 
protein. 




100 


880 


W88991 


Homo sapiens 


Polypeptide fragment encoded by gene 144. 


936 


100 


881 


AF 11 8670 


Homo sapiens 


orphan G protein-coupled receptor 


1971 


100 


882 


AF208865 


Homo sapiens 


EDRF 




100 


883 


Y 18462 


Homo sapiens 


cathepsin L 


209 


72 


SS4 


Y94950 


Homo sapiens 


Human secreted protein clone dhl073_12 protein 
sequence SE<sJ inhj.aUo. 


348 


100 


885 


AF070661 


Homo sapiens 


HbrCOUD 


404 


100 


886 


Y04315 


Homo sapiens 


11 urn on secrcica proLciTi cni*uuczu uy gcuc ^-j ■ 


385 


100 


887 


X92744 


Homo sapiens 


hBD-l 


375 


100 


888 


Y22496 


Homo sapiens 


Human secreted protein sequence clone 
cn621 8. 


994 


94 


889 


Y41293 


Homo sapiens 


Human soluble protein ai jvltu-i. 


4595 


99 


890 


G03714 


Homo sapiens 


Human secreted protein, bc,v jnu. / /yj. 


147 


63 


891 


AF208856 


Homo sapiens 


BM-014 


1017 


99 






nuniu io.pitiio 


neuronal pentraxin 11 


2002 


98 


893 


X68149 


Homo sapiens 


Burkitt lymphoma receptor 1 


1953 


100 


894 


Y94914 


Homo sapiens 


Human secreted protein clone p\v337_6 protein 
sequence SEQ ID NO:34. 


537 


100 


895 


W61630 


Homo sapiens 


Clone HNFGW06 of EGFR receptor family. 


326 


63 


896 


M24110 


Homo sapiens 


G0S19-2 peptide precursor 


481 


100 


897 


Z68747 


Homo sapiens 


imogen 38 


2018 


99 


898 


AF1861I2 


Homo sapiens 


neurokinin B-Iike protein ZNEUROKl 


619 


100 


899 


AF225420 


Homo sapiens 


AD025 


734 100 



123 



MSDOCID- <WO 0157188A2J_> 



WO 01/57188 




CT/US01/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


900 


P60657 


Homo sapiens 


Sequence of human lipocortin. 


1835 


100 


901 


M2728S 


Homo sapiens 


oncostatin M 


1297 


99 


902 


W85737 


Homo sapiens 


Polypeptide with transmembrane domain. 


749 


100 


903 


GO 1349 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5430. • 


650 


99 


904 


Y00261 


Homo sapiens 


Human accreted protein encoded by gene 4. 


1133 


99 


905 


AF039688 


Homo sapiens 


antigen NY-CO-3 


771 


99 


906 


AB007836 


Homo sapiens 


Hic-5 


2544 


100 


907 


AB017507 


Homo sapiens 


Apgl2 


224 


100 


908 


AK000056 


Homo sapiens 


unnamed protein product 


1537 


98 


909 


Y86299 


Homo sapiens 


Human secreted protein HFOXB55, SEQ ID 
NO:214. 


427 


100 


910 


AF23I023 


Homo sapiens 


protocadherin Flamingo 1 


7393 


99 


911 


Y14134 


Homo sapiens 


Vascular endothelial cell growth inhibitor beta 
protein sequence. 


1319 


100 


912 


Z90420 


Homo sapiens 


Human GDF-3 (hGDF-3) polypeptide encoding 
cDNA. 


1950 


100 


913 


Y 19757 


Homo sapiens 


SEQ ID NO 475 from W09922243. 


1361 


100 


914 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7253. 


112 


48 


915 


U14971 


Homo sapiens 


ribosomal protein S9 


886 


90 


916 


AF 1 72854 


Homo sapiens 


card iotroph in-like cytokine CLC 


1204 


99 


917 


AC005525 


Homo sapiens 


F22162_l 


1963 


100 


918 


AF 166350 


Homo sapiens 


ST7 protein 


4711 


99 


919 


Y87285 


Homo sapiens 


Human signal peptide containing protein HSPP- 
62 SEQ ID NO:62. 


430 


100 


920 


Y36131 


Homo sapiens 


Human secreted protein #3. 


465 


88 


921 


AF193766 


Homo sapiens 

i- 


cytokine-like protein CI 7 


724 


100 


922 


Y95013 


Homo sapiens 


Human secreted protein vc48_l, SEQ ID NO:66. 


357 


100 


923 


X7520S 


Homo sapiens 


protein tyrosine kinase-receptor 


5256 


100 


924 


Y96202 


Homo sapiens 


IkappaB kinase (JKK) binding protein, Y2H56. 


813 


98 


925 


AB039886 


Homo sapiens 


down-regulated in gastric cancer 


785 


78 


926 


G03368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7449. 


55 


50 


927 


Y48606 


Homo sapiens 


Human breast tumour-associated protein 67. 


539 


100 


928 


Y36151 


Homo sapiens 


Human secreted protein #23. 


668 


100 


929 


AF1 10399 


Homo sapiens 


elongation factor Ts 


1666 


100 


930 


AF210317 


Homo sapiens 


facilitative glucose transporter family member 
GLUT9 


2763 


99 


931 


Y73328 


Homo sapiens 


HTRM clone 082843 protein sequence. 


931 


100 


932 


G01959 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6040. 


274 


100 


933 


U47924 


Homo sapiens 


B-cell receptor associated protein 


1469 


100 


934 


G03827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7908. 


529 


93 


935 


AB039371 


Homo sapiens 


mitochondrial ABC transporter 3 


196 


63 


936 


X56385 


Canis 
familiaris 


rab8 


1064 


100 


937 


B08906 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 16 SEQ JD NO:63. 


117 


44 


938 


Ml 3692 


Homo sapiens 


alpha- 1 acid glycoprotein precursor 


1064 


99 


939 


Y53886 


Homo sapiens 


A suppressor of cytokine signalling protein 
designated HSCOP-6. 


515 


42 


940 


Y 16630 


Homo sapiens 


Human Putative Adrenomedullin Receptor 
(PAR). 


1904 


99 


941 


AC005102 


Homo sapiens 


small inducible cytokine subfamily A member 
24 


627 


99 


942 


M12886 


Homo sapiens 


T-cell receptor beta chain 


1289 


81 


943 


AF226046 


Homo sapiens 


GK003 


1049 


98 


944 


Y36078 


Homo sapiens 


Extended human secreted protein sequence, SEQ 
ID NO. 463. 


667 


100 ! 


945 


M22877 


Homo sapiens 


cytochrome c 


565 


100 


946 


W67869 


Homo sapiens 


Human secreted protein encoded by gene 63 
clone HHGDB72. 


551 


93 


947 


W67S59 


Homo sapiens 


Human secreted protein encoded by gene 53 ■ 
clone HBMCL41. 


283 


100 


948 


W85726 


Homo sapiens 


Novel protein (Clone BG33J7). 


789 


100 


949 


AJ242015 


Homo sapiens 


eMDC II protein 


4236 


100 


950 


G04075 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8156. 


567 


99 



124 



3 DOC ID: <WO 0157168A2J_> 



WO 01/57188 



PCT/US01/0380M 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


7o 

Identity 


951 


AF 110645 


Homo sapiens 


candidate tumor suppressor p33 ING1 homolog 


li 14 


1 nn 

1UU 


952 


Y36111 


Homo sapiens 


Extended human secreted protein sequence, SEQ 
ID NO. 496. 


402 


70 


953 


AB012109 


Homo sapiens 


APC10 


991) 


i fin 


954 


AF246221 


Homo sapiens 


transmembrane protein BRJ 


i /i n< 
1405 


i aa 

1UU 


955 


AF054986 


Homo sapiens 


putative transmembrane GTPase 


18S3 


1UU 


956 


W74726 


Homo sapiens 


Human secreted protein fg949 3. 


lo/9 


1 AA 


957 


Y27096 


Homo sapiens 


Human viral receptor protein (ACVRP). 


1 CO 1 

15M 


1 AA 


958 


AJ222967 


Homo sapiens 


cystinosin 


1920 


100 


959 


Y53052 


Homo sapiens 


Human secreted protein clone df202_J5 protein 
sequence SEQ ID NO: 110. 


587 

- 


i nn 
1UU 


960 


G02694 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6775. 


283 


1 An 

1VJU 


961 


AF 15 1855 


Homo sapiens 


CG1-97 protein 


1214 


yo 


962 


U26592 


Homo sapiens 


diabetes mellitus type I autoantigen 


250 




963 


AL050306 


Flomo sapiens 


dJ475B7.2 (novel protein) 


3796 


1 AA 


964 


AF078859 


Homo sapiens 


PTD004 


2089 




965 


AB020315 


Homo sapiens 


homologue of mouse dkk-1 gene:Acc# 
AF030433 


1466 


1 nn 
lUU 


966 


X04571 


Homo sapiens 


precursor polypeptide (AA -22 to 1 185) 


6580 


on 

yy 


967 


AF 146019 


Homo sapiens 


hepatocellular carcinoma antigen gene 520 


993 


yy 


968 


AF071002 


Homo sapiens 


minK-related peptide 1; MiRPl 


632 


i An 


969 


AB021227 


Homo sapiens 


rnembrane-type-5 matrix metal loproteinasc 


3545 


i nn 


970 


AF 180920 


Homo sapiens 


cyclin L ania-6a 


1579 


1 nn 
100 


971 


AF 105365 


Homo sapiens 


K-Cl cotransporter KCC4 


5621 


99 


972 


AF083248 


Homo sapiens 


ribosomal protein L26 homolog 


739 


100 


973 


AJ 132429 


Homo sapiens 


hyperpolarization-activated cyclic nucleotide 
gated cation channel hHCN4 


6295 


100 


974 


W61619 


Homo sapiens 


Clone HTPEF86 of TM4SF superfamily. 


454 


100 


975 


AF155100 


Homo sapiens 


zinc finger protein NY-REN-21 antigen 


2261 


100 


976 


AF275948 


Homo sapiens 


ABCA1 1 


11763 


99 


977 


AB026891 


Homo sapiens 


cystine/glutamate transporter 


2552 


100 


978 


AF 117657 


Homo sapiens 


thyroid hormone receptor-associated protein 
complex component TRAP80 


3348 


99 


979 


AF044201 


Rattus 
norvegicus 


neural membrane protein 35; NMP35 


1570 


92 


980 


AF 119297 


Homo sapiens 


neuroendocrine-specific protein-like protein 1 


1170 


99 


981 


AF 155652 


Homo sapiens 


potassium channel modulatory factor 


1983 


99 


982 


W88499 


Homo sapiens 


Human stomach carcinoma clone HP10412- 
encoded protein. 


1553 


99 


983 


Z56281 


Homo sapiens 


interferon regulatory factor 3 


2012 


98 


984 


AB026125 


Homo sapiens 


ART-4 


2160 


100 


985 


Y 14482 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 17. 


172 


70 


986 


AB023888 


Homo sapiens 


b-chemokine receptor CCR4 


1895 


100 


987 


W27291 


Homo sapiens 


Human HI 07 5-1 secreted protein 5' end. 


712 


100 


988 


AF 153450 


Manduca 
sexta 


juvenile hormone esterase binding protein 


226 


32 


989 


G03697 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7778. 


194 


oo 
OO 


990 


AF204159 


Homo sapiens 


potassium large conductance calcium-activated 
channel beta 3a subunit 


14S6 


100 


991 


G02061 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6142. 


558 


yy 


992 


AL031266 


Caenorhabditi 
s elegans 


VM106R.1 


327 


40 


993 


Y66749 


Homo sapiens 


Membrane-bound protein PROl 124. 


4730 


nn 


994 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5327. 


141 


77 


995 


AF133845 


Homo sapiens 


corin 


con 
DiSl 1 


yy 


996 


AF1 17756 


Homo sapiens 


thyroid hormone receptor-associated protein 
complex component TRAP 1 50 


4999 


100 


997 


W62066 


Homo sapiens 


Human stem cell antigen 2. 


284 


93 


998 


Y87173 


Homo sapiens 


Human secreted protein sequence SEQ ID 
NO:212. 


725 


100 


999 


Y 13379 


Homo sapiens 


Amino acid sequence of protein PR0263. 


1654 


99 


1000 


Y95008 


Homo sapiens 


Human secreted protein vf3 1, SEQ ID NO:56. 


676 


47 


1001 


AF190167 


Homo sapiens 


membrane associated protein SLP-2 


1747 


100 



125 



MSDOCID: <WO 0157188A2_I_> 



WO 01/57188 




'CT/US0 1/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


1002 


GO 1234 


Homo sapiens 


Human secreted protein, SEQ ID NO: 53 15. 


398 


96 


1003 


W73420 


Homo sapiens 


Human secreted protein encoded by Gene No. 
24. 


2150 


100 


1004 


X12791 


Homo sapiens 


1 9JcD SRP-protein (AA 1 - 1 44) 


742 


100 


1005 


M23323 


Homo sapiens 


membrane protein 


642 


100 


1006 


X63745 


Homo sapiens 


KDEL receptor 


326 


98 


1007 


Y35997 


Homo sapiens 


Extended human secreted protein sequence, SEQ 
ID NO. 382. 


824 


99 


1008 


AB032918 


Hylobates 
moloch 


dopamine receptor D4 


92 


35 


1009 


Y91680 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 81 SEQ ID NO:353. • 


1372 


99 


1010 


AL136125 


Homo sapiens 


dJ304B14.1 (novel protein) 


825 


98 


101! 


G03733 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7814. 


379 


98 


1012 


Y17531 


Homo sapiens 


Human secreted protein clone BL205 14 protein. 


81S 


97 


1013 


G00724 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4805. 


462 


100 


1014 


AF2S8092 


Naegleria 
gruberi 


haem lyase 


1 14 


37 


1015 


AB045292 


Homo sapiens 


M83 protein 


3867 


99 


1016 


XI 5940 


Homo sapiens 


ribosomal protein L31 (AA 1-125) 


644 


100 


1017 


Y94873 


Homo sapiens 


Human protein clone HP02632. 


1876 


100 


1018 


AL024498 


Homo sapiens 


dJ417M14.1 (novel protein) 


589 


100 


1019 


XS3425 


Homo sapiens 


Lutheran blood group glycoprotein 


3054 


99 


1020 


W03516 


Homo sapiens 


Prostaglandin DP receptor. 


1 864 


100 


1021 


G03960 


Homo sapiens 


Human secreted protein, SEQ ID NO: S041. 


398 


100 


1022 


Y91689 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 93 SEQ ID NO:362. 


768 


100 


1023 


AE000660 


Homo sapiens 


hADV36Sl 


573 


100 


1024 


AF 132965 


Homo sapiens 


CGI-31 protein 


1550 


100 


1025 


W92380 


Homo sapiens 


Human TR-interacting protein Si 03a. 


1466 


97 


1026 


R66278 


Homo sapiens 


Therapeutic polypeptide from glioblastoma cell 
line. 


S30 


100 


1027 


X65614 


Homo sapiens 


SI OOP calcium-binding protein 


476 


100 


102S 


Y41741 


Homo sapiens 


Human PR.O704 protein sequence. 


1323 


100 


1029 


AJ0010I4 


Homo sapiens 


RAMP1 


806 


100 


1030 


W63682 


Homo sapiens 


Human Qp.rr£*tpri nrotfin 7 

J. A-U1I1&L11 JvV/l ^LVyU \J 1 KJ Lv ill A» • 


1354 


00 


1031 


AK023007 


Homo sapiens 


unnamed protein product 


766 


100 


1032 


W97900 


Homo sapiens 


Human SR-BI class B scavenger. 


2672 


99 


1033 


Y82453 


Homo saniens 


Human TGC-440 secretorv nrotein SFO TD 
NO:l. 


639 




1034 


Y73473 


Homo sapiens 


Human secreted protein clone ydl78_l protein 
sequence SEQ ED NO: 168. 


752 


93 


1035 


Y86468 


Homo sapiens 


Human gene 48-cncodcd protein fragment, SEQ 
IDNO:383. 


96 


90 


1036 


U09813 


Homo sapiens 


mitochondrial ATP synthase subunit 9 precursor 


698 


100 


1037 


AJ242832 


Homo sapiens 


calpain 


3699 


99 


1038 


X66403 


Homo sapiens 


acetylcholine receptor epsilon subunit CHRNE 


2574 


100 


1039 


AJ242730 


Homo sapiens 


polyhomeotic 2 


1310 


100 


1040 • 


AF 169968 


Mus 

musculus 


DNA binding protein DESRT 


1453 


80 


1041 


X52563 


Bos taurus 


pcrmability increasing protein 


383 


29 


1042 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4449. 


75 


50 


1043 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6613. 


60 


53 


1044 


M94582 


Homo sapiens 


interleukin 8 receptor B 


1850 


100 


1045 


AL080">39 


Homo ^anif*n^ 


hfP'SfiO?'} 1 Similar tn TfrF AT 9 /inciilin-1 iLtr 

growth factor binding protein, acid labile 
subunit)) 






1046 


AF125101 


Homo sapiens 


HSPC040 protein 


580 


100 


1047 


W74809 


Homo sapiens 


Human secreted protein encoded by gene 8 1 
clone HMWDN32. 


176 


100 


1048 


AL022238 


Homo sapiens 


dJ 1 042K 1 0.4 (novel protein) 5 


2201 


100 


1049 ■ 


W88667 


Homo sapiens 


Secreted protein encoded by gene 1 34 clone 
HAIBP89. 


1559 


99 


1050 


AF097518 


Homo sapiens 


liver-specific transporter 


2820 


100 



126 



30OCID: <WO 0157l88A2J_> 



WO 01/57188 — ™ PCT/US01/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


1051 


W78324 j 


Homo sapiens 


rragment oi numan seureicu pioitm ui^uu^u 
gene 81. 


1318 


98 


1052 


Y21851 


Homo sapiens 


Human signal peptidc-contianing protein (SIGP) 
(clone ID 2328134). 


1643 


95 


1053 


AL163815 


Arabidopsis 
thaliana 


putative protein 

Human secreted protein encoded by gene 77. 


661 
262 


62 
100 


1054 
1055 


Y76200 
AJ276567 


Homo sapiens 
Homo sapiens 


TClO-likeRhoGTPase 


1160 


100 


1056 


Y27620 


Homo sapiens 


Human secreieo protein chiajuvai \jy g^ 1 >>V7. ^~r. 


154 


96 


1057 


D14530 


Homo sapiens 


ribosomal protein 


745 
1132 


100 
100 


1058 
1059 

1060 
1061 
1062 


AP132000 
AL031778 

AF227135 

Y27575 

Z11697 


Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 


TADA1 protein 

dJ34B21, 1 (novel BZRP (benzodiuzapine 

r»f«mtnr f nfrinhpran (MBR PBR PBKS IBP, 

recepior ipcripiicicti^ ^ivlda, i x wi^uj xj — ' a > 
Isoquinoline-binding protein)) LIKE protein) 
canaiaate taste receptor 1 ~jsjy 
Human secreted protein encoded by gene No. 9. 
HB 1 5 


920 

134 

1392 

108S 


100 

33 

100 

100 


1063 
1064 
1065 
1066 


AF123757 
AF155135 
Y41674 
AJ250042 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


putative transmembrane protein 
novel retinal pigment epithelial cell protein 
Human channel-related molecule HCRM-2. 
Rab5 GDP/GTP exchange factor homologue 


819 
2932 
936 
2575 


100 
99 
99 
100 


1067 


Y36087 


Homo sapiens 


Extended human secreted protein sequence, SEQ 
ID NO. 472. 


770 


85 


1068 


Y94959 


Homo sapiens 


Human secreted protein clone mc300__l protein 
sequence SEQ ID NO: 124. 


301 


100 


1069 


Y94959 


Homo sapiens 


Human secreted protein clone mc300_l protein 

sequence SEQ ID NO: 124. 

Human leukocyte cell clone HP00804 protein. 


301 
2014 


100 

99 


1070 
1071 
1072 
1073 
1074 


W64535 

X03145 

AL031177 

X82200 

G0321.3 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


pot. ORE in 

dJ889M15.3 (novel protein) 
gpbtaroU 

Ui, m o n c-0.<~rf*if^A r»rrvtein ^sFO ID NO" 7294 

Human secreieo proicm, dc^v 1L7 .i^v.*. i 


148 
821 
249 
99 


50 
91 
62 
47 


1075 
1076 
1077 
1078 


Y36233 
G03187 
L25899 
Y91447 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


Human secreteu proiein encuueu uy gcut i u. 
Human secreted protein, SEQ ID NO: 7268. 
ribosomal protein L10 

Human secreted protein sequence encoded by 
gene 48 SEQ ID NO: 168. 

rj,,nicn c*>^f<»f^W nrnf-pin <sFD I O MO" 5943 


506 
424 
332 
898 

290 


55 
98 
76 
97 

89 


1079 
1080 


GO 1862 
AB039723 


Homo sapiens 
Homo sapiens 


Human secreteu proicm, jz>-*j. 
WNT receptor frizzled-3 


1376 


92 


1081 


AB020527 


Homo sapiens 


rva/rL/' i f cotransponcr iiuiiiunjg, 


269 


100 


1082 


L13802 


Homo sapiens 


ribosmal protein small subunit 


499 


80 


10S3 


W75098 


Homo sapiens 


Human secreted protein encoded by gene 42 
clone rio/\.oi«ij. 

Human secreted protein, SEQ ID NO: 7645. 


143 
83 


81 
51 


1084 
1085 
1086 


G03564 
G04063 
AF090942 


Homo sapiens 
Homo sapiens 
Homo sapiens 


n„ m in cr^rrvr^/H nrntpin ST 3 O TO NO' 8144 

Human secreteu pruLciii, jj-'V iiv urn. 


88 
124 


43 
64 


1087 
1088 


G00517 
G04091 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 4598. 
Human secreieo protein, jcy iu oj /«. 


129 
126 


41 

36 


1089 


AF 140631 


Homo sapiens 


G-protein coupled receptor 14 

Human secreted protein, SEQ ID NO: 8144. 


364 
114 


S2 
32 


1090 
1091 


G04063 
S72304 


Homo sapiens 
Mus sp. 


LMW G-protein 


146 


83 


1092 


W88708 


Homo sapiens 


Secreted protein encoded by gene 175 clone 
HEMAM41. 


405 


100 


1093 


W85612 


Homo sapiens 


Secreted protein clone thl23 5. 


4358 


97 


1094 


Y53012 


Homo sapiens 


Human secreted protein clone pm514_4 protein 
sequence SEQ ED NO:30. 


1013 


99 


1095 
1096 


Y92345 
AF090942 


Homo sapiens 
Homo sapiens 


Human cancer associated antigen precursor from 

clone NY-REN-62. 

PRO0657 

transformation-related protein 


409 

147 
166 


100 

60 
58 


1097 
1098 
1099 


L24521 
X56932 
G04063 


Homo sapiens 
Homo sapiens 
Homo sapiens 


23 kD highly basic protein 

Human secreted protein, SEQ ID NO: 8144. 


490 
83 


70 
35 


1100 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


149 


59 



127 
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SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


1101 


AF1 19851 


Homo sapiens 


PRO 1722 


183 


72 


1102 


G04086 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8167. 


207 


62 


1103 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8144. 


91 


52 


1104 


X74856 


Mus 

musculus 


ribosomal protein L28 


128 


69 


1105 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


130 


62 


1106 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7214. 


122 


48 


1107 


G03040 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7121. 


69 


43 


1108 


AF039942 


Homo sapiens 


HCF-binding transcription factor Zhangfei 


744 


99 


1109 


AF201951 


Homo sapiens 


high affinity immunoglobulin epsilon receptor 
beta subunit 


738 


94 


1110 


AF1III0S 


Mus 

musculus 


transient receptor potential 2 


223 


79 


111] 


AF 119900 


Homo sapiens 


PR02822 


144 


59 


1112 


Y 16589 


Homo sapiens 


A protein that interacts with presenilins. 


265 


39 


1113 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


178 


67 


1114 


Y02999 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 121. 


164 


63 


111S 


Y30811 


Homo sapiens 


Human secreted protein encoded from gene 1. 


1217 


99 


1116 


X5I394 


Xenopus 
laevis 


APEG precursor protein 


130 


40 


1117 


M27826 


Homo sapiens 


neutral protease large subunit 


442 


65 


1118 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7452. 


72 


60 


1119 


G03602 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7683. 


491 


97 


1120 


Y35906 


Homo sapiens 


Extended human secreted protein sequence, SEQ 
ID NO. 155. 


244 


97 


1121 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7795. 


122 


65 


1122 


Y00337 


Homo sapiens 


Human secreted protein encoded by gene 8] . 


110 


90 


1123 


AFOS4S30 


Homo sapiens 


two pore domain K> channel; TASK-2 


703 


94 


1124 


AF2 12862 


Homo sapiens 


membrane interacting protein of RGS16 


442 


88 


1125 


W64469 


Homo sapiens 


Human secreted protein from clone CW795 2. 


191 


53 


1126 


G01361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5442. 


154 


100 


1127 


G01361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5442. 


165 


100 


1128 


Y84320 


Homo sapiens 


Human cardiovascular system associated protein 
kinase- 1 . 


815 


99 


1129 


G02105 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6186. 


88 


73 


1130 


Y32923 


Homo sapiens 


Transmembrane domain containing protein clone 
HP01512. 


700 


100 


1131 


Y29817 


Homo sapiens 


Human synapse related glycoprotein 2. 


260 


91 


1132 


Y91644 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 43 SEQ IDNO:317. 


525 


96 


1133 


Y91449 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 49-SBQ ID NO: 1 70. 


542 


100 


1134 


ABO 17908 


Homo sapiens 


4F2 light chain 


2399 


93 


1135 


X51760 


Homo sapiens 


zinc finger protein (583 AA) 


312 


55 


1136 


Y99426 


Homo sapiens 


Human PRO 1604 (UNQ785) amino acid 
sequence SEQ ID NO:308. 


917 


72 


1137 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7871 . 


102 


50 


1138 


AF155106 


Homo sapiens 


NY-REN-36 antigen 


768 


91 


1139 


AL031055 


Homo sapiens 


dJ2SH20.1 (novel protein similar to membrane 
transport proteins) 


117 


50 


1140 


AF011359 


Bos taurus 


regulator of G-protein signaling 7 


138 


96 


1141 


Y70018 


Homo sapiens 


Human Protease and associated protein- 1 2 
(PPRG-12). 


623 


100 


1142 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8172. 


113 


38 


1143 


AB030235 


Can is 
familiaris 


D4 dopamine receptor 


89 


48 


1144 


Y94922 


Homo sapiens 


Human secreted protein clone pv6_l protein 
sequence SEQ ID NO:50. 


539 


88 


1145 


X99962 


Homo sapiens 


rab-reJated GTP-binding protein 


39S 


96 


1146 


G03807 


Homo sapiens 


Human. secreted protein, SEQ ID NO: 7888. 


168 


79 


1147 


G03712 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7793. 


512 


85 


1148 


Y28279 


Homo sapiens 


Human G-protein coupled receptor GR1R-1. 


705 


76 


1149 


U136 \2 


Caenorhabditi | 


exon 5 similar to transmembrane domain of S. 


247 


36 
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in 

NO: 


No. 


^ItippiP 5 ; 


Description 


Smith- 
Waterman 
Score 


% 

Identity 






s elegans 


cerevisiae zinc resistance protein 






1 1 so 




Homo sapiens 


Human secreted protein, SEQ ID NO: 7519. 


117 


62 


11-31 




Pnmn canien*? 


Human secreted protein, SEQ ID NO: 5084. 


181, 


80 


1 1 ^7 
1 1 JZ 


G03798 


Hnmo ^aniens 


Human secreted protein, SEQ ID NO: 7879. 


198 


63 


1 1 JJ 


AOO / .7.7 


Orvza sativa 


DNA binding protein 


95 


41 


1 1 S4 
l I 


D85245 


Homo sapiens 


TR3beta 


155 


96 


1155 


R74272 


Homo sapiens 


Tumour suppressor protein, p53. 


341 


87 


1 I DO 


I OOZO-/ 


Hnmrt sar>ipn<i 

lHJtllU Jflpi^Uj 


Human secreted protein HUSXE77, SEQ ID 
NO: 180. 


99 


41 


1157 


G02577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6658. 


263 


98 


1 1 ->r 




Hnmn Qampnc 

I 1 U 1 1 1 V/ Otlplv.Ho 


putative organic anion transporter 


185 


42 


1 1 Dy 


rim "}cn 


T-lr\mr> ^anipnc 


Human secreted protein, SEQ ID NO: 5474. 


173 


57 


1 lOU 


U/7-C771 
W /Dfil 


f-Tnmr» cam'pnc 

J 1LH11VJ ia.L'l»--l I.J 


Human GTP binding protein APD08. 


224 


81 


1161 


AF2 16833 


Homo sapiens 


M-ABC2 protein 


410 


S3 


1 162 


Wo/olo 


nomo sapiens 


Human ^rrpreH nrotfttn encoded bv &enc 10 
clone HCEMU42. 


1156 


100 


1 163 


Ar 1 lyoDl 


Homo sapiens 


PROl 77? 


230 


70 


1 164 


Yo /zjz 


riomo sapiens 


T4i iman clonal npntirlf t^nntaininp Drotein HSPf- 


113 


31 


I 10 J 


W 04JJ / 




Human liver cell clone HP01 148 protein. 


338 


82 


1 1 <iA 

J Joo 


/\rzoyzoo 


ttUllWJ 6apICiI5 


HC6 


134 


64 


1 lo / 


V 1 /1/1Q7 
I 1445Z 


rioiuu bapicnb 


Fragment of human secreted protein encoded by 
gene 17. 


149 


51 


1 1 /CO 

1 loo 




Escherichia 


r>inpnfiHe Iran snort svstern nermease Drotein 
DppC. 


411 


90 




IxOO /o^ 


Hr\mo cnr*ipnQ 


TG0847 protein. 


344 


90 


1 J /U 


I *oz / * 




Human secreted protein encoded from gene 1 8. 


478 


98 


I J /I 


1J04 J 


JiOlJlU idpJCllD 


Mr 1 1 0 000 antigen 


347 


96 


1 I / Z 


/aoUZOZ DO 




organic anion transporter OATP-B 


311 


67 


1 1 7^ 




Hrtmn cjmipnc 
1 li w ^apibiis 


Human secreted protein, SEQ ID NO: 4438. 


60 


52 


1174 


D87717 


Homo sapiens 


similar to human GTPase-activating 
nroteinfA49869) 


178 


59 


1 1 / J 


IvlO^f / ID 


Hr»mn canipnc 
n.LHIlxJ bapltlli 


ribosomal protein 


391 


78 


1 T TA 




14r\mri cjiTiipnc 


Human IL-7 receptor clone H6. 


285 


67 


1177 


LO6505 


Homo sapiens 


ribosomal protein LI 2 


242 


72 


1 I/O 


A 10^1 RQ •> 


noiiiu idpioiib 


nroanir rafinn transnorter TOCX2^ 


276 


88 


1 179 


LjUjZjO 


nOITIO SdjtlCIJS 


Human secreted protein, SEQ ID NO: 7339. 


155 


71 


I J oU 


rim 7D7 


ilOniCJ bdJJiClii 


Human secreted protein, SEQ ID NO: 528S. 


282 


90 


1181 


AF181856 


Rattus 
norvt/giLUo 


tRNA selenocystcine associated protein 


249 


62 


1 1 oZ 


A P 1 A 1 *\*)A 

f\r I O 1 JZ*f 


rlOllli' isdJPIClIi 


HSPP176 


138 


90 


1 1 07 

1 1 ©3 




n.UHHJ aapiciia 


Human secreted protein, SEQ ID NO: 7870. 


282 


66 


I lo4 


■vm£7i 

IUZD / 1 


nOmU bdpiCllb 


Human secreted protein encoded by gene 22 
cloneHMSJW18. 


107 


71 


1 IOC 

J 1 OJ 


vjUj ly i 


Urimrt cjinipnQ 
JiOlJlU iajJJCiii 


Human secreted protein, SEQ ID NO: 7878. 


88 


69 


1186 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7645. 


118 


46 


1 187 


ABUjzVvJj 


riyiooaies 

HJIlt-UllJl 


rtrmaminp rPPf*r»tDr D4 


96 


37 


1 1 RS 

1 loo 


VJUuy JD 


XlL/lllvJ oa^JIV/lEO 


Human secreted protein, SEQ ID NO: 5037. 


292 


78 


I 1 CO 

I I oy 


VJUOZJO 


f-Tnmri QarMPn^i 

11UJ11U ia.JJl<-.HO 


Human secreted protein, SEQ ID NO: 7339. 


178 


79 


1 1 GA 

i iyu 


KJUJ jO\ 


]J|->TT1 0 Cf>r"ltf»nC 


Human secreted protein, SEQ ID NO: 7442. 


324 


76 


i iyi 


API 177^ 
AT 11/ / JJ 


Unmn MniPns 
nuinu aapicilb 


thyroid hormone receptor-associated protein 
complex component TRAP230 


187 


70 


1192 


Y70455 


Homo sapiens 


Human membrane channel protein-5 (MECHP- 
j »■ ... 


202 


67 


1193 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7133. 


99 


42 


1194 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6688. 


192 


76 


1195 


W29661 


Homo sapiens 


Homo sapiens CI542 2 clone secreted protein. 


2001 


98 


1196 


Y14104 


Homo sapiens 


Human GABAB receptor Id protein sequence. 


239 


69 


1197 


X61972 


Homo sapiens 


macropain subunil iota 


149 


90 


1198 


G00534 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4615. 


145 


51 


1199 


YS6260 


Homo sapiens 


Human secreted protein HELHN47, SEQ ID 
NO:175. 


1089 


89 


1200 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6688. 


154 


57 



129 
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ID 

NO: 


No. 


Snprip^ 




Smith - 

Watcrman 

Score 


% 

Identity 


120 i 


G00S38 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4919. 


404 


50 


1202 


M27826 


Homo sapiens 


neutral protease large subunit 


202 


49 


1203 


Y73424 


Homo sapiens 


Human secreted protein clone yi4 1 protein 
sequence SEQ ID NO:70. 


265 


61 


1204 


AF264014 


Homo sapiens 


scavenger receptor cysteine- rich type 1 protein 
Ml 60 precursor 


625 


98 


1205 


Y36203 


Homo sapiens 


Human secreted protein #75. 


219 


59 


1206 


U78111 


Gallus gallus 


AQ 


205 


57 


1207 


AF09544S 


Homo sapiens 


putative G protein-coupled receptor 


416 


76 


120S 


AF116715 


Homo sapiens 


PR02829 


127 


75 


1209 


AF099137 


Homo sapiens 


MaxiK channel beta 2 subunit 


475 


95 


1210 


AF205718 


Homo sapiens 


hepatocellular carcinoma-related putative tumor 
suppressor 


423 


79 


1211 


Y27868 


Homo sapiens 


Human secreted protein encoded by gene No. 
107. 


224 


70 


1212 


G00719 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4800. 


117 


44 


1213 


GO 1009 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5090. 


351 


73 


1214 


AF090942 


Homo sapiens 


PRO0657 


124 


70 


1215 


Y 14427 


Homo sapiens 


Human secreted protein encoded by gene 1 7 
clone HS IE A 14. 


99 


77 


1216 


G03905 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7986. 


173 


57 


1217 


Y57897 


Hnmn <;anipn«i 


Human trflnimftmhranf nrotpin WTh/fPTvJ-'!? 1 


1 173 


100 


1218 


J00194 


T-Tf*TY"»0 Cflm^n? 


ma ui ai 1 11 ti.v-'i i aiyjxiCL v^nciiit 


454 


78 


1219 


Y59709 


Wrvmn Qani^ni 




470 


Q7 


1220 


W81 576 


Hnmn <;anipn^ 
iiuiiiu jcijjiliij 


F Ft V-inn'iirpn' fi-nrofpin poiintpri rpppntor fFFlT- 

2) polypeptide. 


725 


100 


1221 


W96745 


Homo sapiens 


High affinity immunoglobulin E rcceptor-likc 

nrotpin f'TnF/RFO 

1 W Lv- 111 \ 1 KJ J-^ 1\_1_> y . 


650 


98 


1222 


Y3591 1 


Homo sapiens 


Extended human secreted protein sequence SEQ 
ID NO. 1 60. 


135 


31 


1223 


Y00278 


Homo ^artien^ 


Human ^erreteH nrntein encorieH hv trenft 21 


260 


95 


1224 


AF 16 1422 


Homo ^anipn^ 


HSPC304 

X Xi^JX V r 


568 


90 


1225 


U 14970 


Homo saniens 


rihosomal nrotein S5 


202 


95 


1226 


G01733 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5814. 


610 


100 


1227 


AF099973 


musculus 


crh lntpn'7 

Jvli 1 iXY 


333 


56 


1228 


Q01218 


Homo ^antpn^ 


Human s:crrprpH motpin SFO ID "NJO - ^?Q9 


1 55 


81 


1229 


AF217188 


Mus 

musculus 


YIP IB 


801 


63 


1230 


AF 176813 


Homo ?anipn^ 

X 1 w 1 1 1U oGJJl^llo 


^olnhlr* aHpnvlvl pvpI?*cp 


275 


100 


1231 


X98333 


Homo <?anipns 


oroanic nation tran<ioortpr 


1704 


100 


1232 


W74955 


Homo sapiens 


Human secreted protein encoded by gene 77 
clone HOEAS24. 


212 


53 


1233 


Y94940 


Homo <?ani(*ns 


Human ^orrPtoH nrotpin rlonp vifi'} 1 nrotpin 

sequence SEQ ID NO:86. 


526 


100 1 


1234 


U76618 


Mus 

musculus 


N-RAP 


482 


82 


1235 


AF044924 


Homo sapiens 


hook2 protein 


380 


97 


1236 


G01459 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5540. 


417 


100 


1237 


AF00001 8 


Homo sapiens 


adapter protein 


164 


84 


1238 


W88633 


Homo sapiens 


Secreted protein encoded by gene 1 00 clone 
HE8EU04. 


250 


90 


1239 


W29660 


Homo sapiens 


Homo sapiens CH27_1 clone secreted protein. 


697 


98 


1240 


AF004I61 


Oryctolagus 
cuniculus 


nproYi'^omal Ca-rfpnpnrlpnt Qoliitp parripr 


154 


52 


1241 


Y92710 


Homo sapiens 


Human membrane-associated protein Zsig24. 


709 


97 


1242 


Y95002 


Homo sapiens 


Human secreted protein vc34_l, SEQ ID NO:44. 


908 


88 


1243 


Y44905 


Homo sapiens 


Human potassium channel molecule ERG-LP2 
partial protein. 


325 


100 


1244 


AF284422 


Homo sapiens 


cation-chloride cotransporter-interacting protein 


511 


97 


1245 


Y53629 


Homo sapiens 


A bone marrow secreted protein designated 
BMS115. 


1888 


93 


1246 


AB039371 


Homo sapiens 


mitochondrial ABC transporter 3 


389 


97 


1247 


Y35911 


Homo sapiens 


Extended human secreted protein sequence, SEQ 


168 


39 
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De^cnDtion 


Smith- 


% 


ID 


No. 




Waterman 


Identity 


NO: 








Score 










ID NO. 160. 






1248 


AF072509 


Rattus 


glutamate receptor interacting protein 2 


559 


90 






norvegicus 








1249 


AF^ 47042 


Homo sapiens 


tandem pore domain potassium channel TRAAK 


661 


98 


1250 


B08974 


Homo sapiens 


Human secreted protein sequence encoded by 


1087 


97 








gene 27 SEQIDNO:131. 






1251 


L15313 


Caenorhabditi 


putative 


858 


59 






s elegans 








1252 


Y29338 


Homo sapiens 


Human secreted protein clone it217_2 alternate 


278 


75 








reading frame protein. 






1253 


W01730 


Homo sapiens 


Human G -protein receptor HPRAJ70. 


211 


92 


1254 


G03074 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7155. 


294 


83 


1255 


G01S18 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5899. 


253 


91 


1256 


AF286368 


Homo sapiens 


eppin-1 


222 


54 


1257 


AF220264 


Homo sapiens 


MOST-1 ! 


87 


93 


19SR 

JLZ.~>0 


G02227 




Human secreted protein, SEQ ID NO: 6308. 


281 


78 




Y07970 


Homo Qanipn*; 


Human secreted protein fragment #2 encoded 


81 


94 








from gene 26. 






I960 


R95332 




Tumor necrosis factor receptor 1 death domain 


986 


100 








ligand (clone 3TW). 






1261 


AF 140674 


Homo sapiens 


zinc metal loprotease ADAMTS6 


172 


36 


1262 


U2S369 


Homo sapiens 


semaphorin V 


237 


67 


1263 


Y07049 


Homo sapiens 


Renal cancer associated antigen precursor 


288 


71 








sequence. 






1 iOH 


1 JO 1 _/ J 


W("*n"w\ •j^nipn^ 
nuiii\j ocipitiiij 


Human secreted protein #25. 


187 


80 


J ZOj 


V7R1 1 4 
I /oil 1 * 




Human cytokine signal regulator CKSR-2 SEQ 


723 


93 








ID NO:2. 






19£fi 


Y 13397 




Amino acid sequence of protein PR0334. 


191 


100 


1 — u / 


/AJL UJUJJ O 


Rattus 


phosphatidylinositol 5-phosphate 4-kinase 


859 


95 








gamma 






i/Oo 




Mnmo cars i pri ^ 


candidate tumor suppressor gene LUCA-1 


159 


96 


1 9£Q 


AF ion A Ad 


Mus 


t MRR2 


552 


76 






mi kpi i In q 










at n^n^^ 


Wr\mn QJini^nc 
IT.UII1U oUpiCUo 


dJ570F3.1 (homolog of the rat synaptic ras 


820 


98 








GTPase-activating protein pl35 SynGAP) 








G02126 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6207. 


131 


95 


1 979 




T-T j a m r\ c^ini fn ? 
llOlllU DClJJlC/llD 


N ADH-cvtochrome b5 reductase isoform 


253 


92 




AT n^^i 


nUIllU baJJIClli 


rtT^RPl 1 1 fnfwpl AMP-hindini? en7vme 


1280 


100 








simitar to acetyl-coenzyme A synthethase 












(acetate-coA ligase)) 






1974 


AF06474R 


N4us 


S3-12 


3523 


61 






musculus 








1275 


D17554 


Homo sapiens 


TAXREB107 


377 


7S 


1276 


Y30715 


Homo sapiens 


Amino acid sequence of a human secreted 


643 


90 








protein. 






1277 


AF 1 46760 


Homo sapiens 


septin 2-1 ike cell division control protein 


707 


100 


1278 


Y05069 


Homo sapiens 


Human PIGR-2 protein sequence. 


281 


46 


1279 


X59668 


Oryctolagus 


aorta CNG channel (rACNG) 


267 


85 






cuniculus 








1280 


G01051 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5132. 


489 


98 


1281 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7492. 


120 


43 


19R? 

1 z.oz. 






very large G-protein coupled receptor- 1 


1635 


100 


1283 


AF117814 


Mus 


odd -skipped related 1 protein 


357 


98 






mncru Ins 








19R4 


UO/JlO 




NaDC-2 


535 


60 






laevis 








1285 


AF061346 


Mus 


Edpl protein 


452 


68 






musculus 








1286 


AB030182 


Mus 


contains transmembrane (TM) region 


582 


68 






musculus 








1287 


A13595 


synthetic 


immunosuppresive protein PP15 


185 


97 






construct 








1288 


AF25441 1 


Homo sapiens 


ser/arg-rich pre-mRNA splicing factor SR-A1 


837 


100 


1289 


AF084205 


Rattus 


serine/threonine protein kinase TAOl 


319 


98 






norvegicus 
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9pn 

ocy 
ID 

NO: 


No. 




F) f» cf* ri r\t i r\n 


o nil III - 

w aicriiiaii 

Score 


OA 
SO 

lucniii) 


1290 


AF03S563 


Homo sapiens 


membrane associated guanylate kinase 2 


523 


100 


1291 


AF034837 


Homo sapiens 


double-stranded RNA specific adenosine 
deaminase 


468 


100 


1292 


Ml 5888 


Bos taurus 


endozepine-related protein precursor 


937 


87 


i y j 


A RO 10692 


thaliana 


A'1~P-Hpnf*nfif*nt R7\1A hpHpjiQp-liV'p r\riMpin 


O.iO 


*+ j 


1294 


AF209923 


Homo sapiens 


omhan {"^-nrAtpin pnnnlpd rpppntnr 


1570 


100 


1295 


W67828 


Homo sapiens 


Human secreted protein encoded by gene 22 
clone HFFAF41 


504 


98 


1296 


AC004832 


Homo sapiens 


similar to 45 kDa secretory protein ; similar to 
CAA10644 1 i'PlDff41644l8 > ) 


648 


65 


1297 


X 8003 5 


cuniculu*; 


cv^tpinp riph hair lrpr?*tin a^copiatpH r%rotPtTi 


575 


70 


1298 


G02645 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6726. 


223 


97 


1299 


Y59440 


Homo sapiens 


Human deltas fragment #4. 


122 


32 


1300 


W70504 


Homo ^anien*; 


I ptikropvte ^pven timp<; mPmhranp-npnpiTatirio 

LiLUaul/ j L\_- Jtiwll till 1UJ lll^lJIUlull^ [Jvll^ii OLlll^ 

type receptor protein JEGl 8. 


459 


O 1 


1301 


Y67315 


Homo sapiens 


Human secreted nrotein BI 89 13 amino acid 
sequence. 


3916 


99 


1302 


M77693 


Homo sapiens 


spermidine/spermine Nl -acetyltransferasc 


174 


96 


1303 


G01331 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5412. 


254 


69 


1304 


G01491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5572. 


747 


99 


1305 


AF 148509 


Hntrifl ^ar>ipn<: 


alnlia 1 7-mannoQiHficp 

ulLJIlCl 1 .— 11 tCXi 11 I'J JlUuoC/ 


602 


OR 


1306 


GO 1658 


Mnmn ^ar*ipn^ 


Human <;pcrptpd nrotpin RFO TD NO- S7^Q 


jjj 


QR 

70 


1307 


Y90899 


Homo sapiens 


Dl-like dopamine receptor activity modifying 


332 


98 


1308 


J\Ji V/JJiwV 


Homo car»if*nc 




J'l o 




1309 


Y73388 


Homo sapiens 


HTRM clone 3376404 protein sequence. 


147 


66 


mo 




DUi IdUI Lib 


noobomdi proiein lou 




yu 






R^us 


arbeiiiie liiuuciuie ivinav cibbuLidicu pruicin 


Ooo 


"70 


1312 


Y73342 


Homo sapiens 


HTRM clone 2709055 protein sequence. 


1154 


100 


1313 


YQQAiQ 
i yy-r i ? 


T-T<*ii~r-ii-» Mnifnc 

nuinu oapidib 


Unman PRO17S0 (\ rhlOftd.'^ aminn »rirl 
nuiilail I r\U 1 / Ov yKJV^KiO**^) allllilU clClU 

sequence SEQ ID NO:282. 


1 1 


/o 


1314 


AF1 16667 




PRO 1 777 


"TJJ 


Q"7 


1315 


W75100 


Homo sapiens 


Human secreted protein encoded by gene 44 
cloncIIE8CJ26. 


807 


97 


1316 


AJ272078 


Homo sapiens 


APOBEC-1 stimulating protein 


789 


100 


1317 


AB041533 


Homo sapiens 


sperm antigen 


2607 


98 


IjIo 


Til Q/£1 *7 


Mus 

musculus 


oli-l 




92 


1 "2 1 O 
1 J 17 


Uo2Dyo 


Escherichia 
coli 


ferric enterobactin transport protein 


768 


100 


i T?n 

1 ozu 




Escherichia 


nFHYnRnriFKiA^F eve i t i 
(GLUC1TOL-6- PHOSPHATE 
DEHYDROGENASF^ fKFTOSEPHOSPHATR 
REDUCTASE). 


/i)y 


1UU 


1321 


W67847 


Homo sapiens 


Human secreted protein encoded by gene 41 
clone HPBCJ74. 


601 


92 


1322 


AJ276101 


Homo sapiens 


GPRC5B protein 


466 


93 


1323 


AJ276101 


Homo saniens 


GPRC5B nrotein 


504 


97 


1324 


Y5S628 


Homo sapiens 


Protein re^ulatino oene exnression PRGF-^1 


1584 


100 


1325 


U91561 


Rattus 


pyridoxinc 5'-phosphate oxidase 


1277 


89 


1326 


API 25533 


Hnmn ^anipn^ 


NADH-rvtnrhrnmp rpHnpfncp icnfnrni 


1 606 


i on 


1327 


Y32206 


Homo sapiens 


Human receptor molecule (REC) encoded by 
Incyte clone 2825826. 


1531 


90 


1328 


AF151048 


Homo sapiens 


HSPC214 


657 


85 


1329 


Y10530 


Homo sapiens 


olfactory receptor 


1645 


100 


1330 


AF1806S1 


Homo sapiens 


guanine nucleotide exchange factor 


4314 


99 


1331 


AF1 11856 


Homo sapiens 


sodium dependent phosphate transporter isoform 
NaPi-3b 


3591 


99 


1332 


Y13583 


Homo sapiens 


G-protein coupled receptor 


2171 


100 


1333 


AF078866 


Homo sapiens 


SURF-4 


1395 


100 
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SEQ 
ID 

NO: 
1334 


Accession 

INO. 

Y25755 


Species 
Homo sapiens 


Description 

Human secreted protein encoded from gene 45. 


Smith- 
Waterman 
Score 
1380 
4742 


% 

Identity 

96 
99 


1335 
1336 
1337 

1 Jid 
1339 


AF 152325 

X74070 

AF095927 

\jUjo / / 

AL008582 


Homo sapiens 
Homo sapiens 
Rattus 
norvegicus 
Homo sapiens 
Homo sapiens 


protocadherin gamma A5 
transcription factor B I F3 
protein phosphatase 2C 

Human secreted protein, SEQ ID NO: 795S. 
bK223H9.2 (ortholog of A. thahana F23F1 .8) 


639 
1931 

621 
626 
5820 


81 

95 

100 
100 

99 


1340 
1341 
1342 
1343 


X61615 
Y01519 

U54807 


Homo sapiens 
Homo sapiens 
Homo saniens 
Rattus 
norvegicus 


leukemia inhibitory factor receptor 
A carcinogenesis-inhibiting protein. 

ethanolamine kinase 

GTP-binding protein 


7528 
2372 
1167 


97 

100 

97 


1344 


AC020579 


Arabidopsis 
thai i an a 


putative phosphonbosylformylglycmamidine 
svnthase: 25509-29950 


3283 
944 


51 
100 


1345 
1346 

1347 


Y28576 
W74787 

M55542 


Homo sapiens 
Homo sapiens 

Homo sapiens 


Secreted peptide clone pe503 I. 

Human secreted protein encoded by gene 58 

clone HHFHN61. 

cruanvlate binding protein isoform 1 


1171 

2636 
1329 


100 

87 
100 


1348 
1349 
1350 


API 83428 

U70669 

AF295530 


Homo sapiens 
Homo sapiens 
Homo sapiens 


28.4 kDa protein 
Fas-ligand associated factor 3 
cardiac voltage gated potassium channel 
modulatory subunit 


167 

562 


24 
99 



TABLE 3 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino, acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
■c— phpnviaisminf* G=fi1vcine H = Histidine, 
]=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q-Glutamine, R^Arginine, S=Serine, 
T=Thrconinc 5 V=Vatine ? W=Tryptophan, 
Y=Tyrosine, X=Unknown, ^Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 


1 


1351 


A 


2 


337 


1 


TPSLIHQAPTPCPAGLWG/PPNGHYHUS^UU 
H WPQ APHRA* * * G LLPPRWLGHGLPGGP AAP 
WAASQW VDG VAGRLFGPAWSWHASG AAP A 
OPGPL*LLVPGSSGLPDPRDP 


2 


1352 


A 


27 


100 


366 


1RNSSIRPMKERJETKLSAKHMITCSASYD1KUL. 
QIETTA YHHTPIRMAKIQKT/GHHQC* *ECG AT 
GTLIHGWWGCKVVEPLGKTVWQIPK 


3 


1353 


A 


40 


3 


314 


< HAS AHAS WLKDN SELEQQLGATG AYRARA 
LELEAEVAEMRQMLQLEHPFVNGADKLRPD 
SMYVHLNEL*QSLVENMLLTWDTH\RTPl*R 
SCNYTLALILFL 


4 


1354 


A 


74 


2 


292 1 


TASALFSCPDGGSLAGFAGRRASFHLEtXKK 
QKDRG GDI SQKTVLPLHLVHHQ V AHTFGQ AT 
VTCQQARQSPG* RTNPE/ALQW VLPVSDG WH 

VT PI .P 


5 


1355 


A 


78 


114 


850 


" ENCRVASNLPGVFFSEDTAQSGSYMRISAHPP 
NAGGEVSNGPKRKLTLMLNFSLPSSGLNAGA 
F Y AL STLLNRMVIWITYPGEEVN AGRIGLTIVI 
AGMLG AVIS GI WLDRSKT YKETTL V VYIMDT 
GG A WWCYTFYLGTGDTCG* CF1TAGNTMGFF 
MTGYLPLGFEFAVE1ASYPESEGISSGLLNISA 
QVFGIIFTISQGQIIDNYGTKPGNIFLCVFLTLG 
AALTAFIKADLRRQKANKETLEN 


6 


1356 


A 


81 


97 


376 


EWFSYMLGSNMSVYHSP*SLEPLCKVLSES-A 
YLRVPFIRILLNAR* IRKAYKRMSLEIKLLI/RE 
*CLFQEMGLSLQWLYSARGDFFRATSRL 


7 


1357 


A 


93 


2 


872 


" "TLSSACLIGDAWKEL 11 VAGAVSNQLLV VV YP 
ATALADNKPVAPDRRISGHVGI1FSMSYLESK 
GLLATASEDRSVRIWKGGDLRVPGGRVQNIG 
HCFGHSARVWQVKLLENYLISAGEDCVCLV 
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SEQ ID 

INL/. UI 

nuc!- 
eolide 
seq- 
uence 


SEQ ID 

peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO- 
in 

USSN 
09/496 
914 


Predicted 
ueginning 

nucleotide 
location 
correspondi 
ng to first 
amino acid 

I CjIUUC Ul 

peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 


Amino acid sequence (A^Alanine OCysteine, 
D~Aspartic Acid. E— Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=IIistidinc, 
I=Isoleucine, K-Lysine, L=Leucine. 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine : V=Valine, W=Tryptophan. 

i ~- i yv umiic, /\~ uiuwiiuwn, — oujp luuuii, 

/=possible nucleotide deletion, \=possible 
nucleotide insertion 














WSHEGEILQAFRGHQGRGIRAIAAHERQAWV 
TTGnnn^niRi whi vfiRnYRriT n/ni c;sr t o 

1 1 v_J vj Ls L-J O vJ 1 1 VV n L, V VJIVvJ J rvVJljVJ/ L'LUOLiL^ 

VP** ARYTQGCDSGWLLATAGSD* YRGPVSL 
*RRGOVl GAAARG*TFPVLLPAGGRSWSRGT 
RIVCYGQWGRSCQGCPHQHSNCCCGPDPVS 
WEGAQLELGPAWL 


8 


1358 


A 


106 


3 


350 


FSSLLSGR1STLRDETGAILIDGDPAACAPIIKF 
LLTEELHLRGVSIYVLRHEAQIYG1TPLWCAL 
LI/CRRL*SDSCMRAALNDRGLYQVLILDGLV 
QCLGFVDSDSRKMVSTLT 


Q 


1359 


A 


115 


49 


186 


QAWA1FKGKYKEGDTGGPAVWKTRLRCAJLN 
KSSEFNEGPERERMDV 


10 


1360 


A 


123 


2 


1249 


KGCRTQEKVDRTEVIRTCINPVYSKLFTVDFY 

FEEVQRLRFEVHDISSNHNGLKEADFLGGME 

CTLGQIVSQRKLSKSLLKHGNTAGKSSITVIA 

EELSGNDDYVELAFNARKLDDICDFFSKSDPF 

LEIFRMNDDATQQLVHRTEVVMNNLSPAWK 

brKVb VNbLCbODrURKJ^KCl V WDWDbNUN 

HDFIGEFTSTFKEMRGAMEGKQVQWECINPK 

YKAKKKNYKNSGTVILNLCKJHKMHSFLDYI 

MGGCQIQFTVAIDFTASNGDPRNSCSLHYIHP 

YQPNEYLKALVAVGEICQDYDSDKMFPAFGF 

GARIPPEYTDSHDFAINFNEDNPECAGIQGVV 

EAYQSCFVPKAPTI-TGPTNICPHSSRKVAKFRR 

bJiLjr^^Ml^OK-AJrAllrll^VJJrO^ VuV Y M^UM-Or* 

DNPGGHFV 


11 


1361 


A 


147 


614 


9 


ACARKQLLGRTVFIWFVGQLLGGELKGYSKT 

NTTSSRPASSRGVTLSSSSSSSSSLTKDALPSSL 

KSDSTTITSGLVFPFRSLCVNPAKSSVSESVSSI 

DERVSMGTSSRKPTNSSSSLGALKMSATS\*G 
SGSESPTPFFLTGLQSPPSTRPREPGLTTARNS 
TTLTRDC 


12 


1362 


A 


177 


12 


416 j 


LIPSEPALDSLVDPRVRSRKQPFVIYPVYDTAt 

VLFLDRVYGIEVQDFLLHLLEGGFLPDLRAA 
ASLDT/AEIG AMDFLLS* LFTLCLMMFFFI YPFI 
NLLTMNVY 


13 




A 


i*f y 






wtpwrhi cp a pj i\/f*^r>r^riTr , \/\/QV"VT>r^TvT"r\/o 

V\ J r rltvrlL.or t\xL>i V l^UJ^LJ 1 LV VM Y r viNl Vl^J 

MPDTQMEQGLN/HLFLDGNA*PHSVECYCPS 
TFEIAIKITSFVLYFHRYRAPEVLLRSSVYSSPI 
DVWAVGS1MAELYMLRPLFPGTSEVDEIFKIC 
OV1 GTPKKVSTl VPK1 \ 


14 


1364 


A 


254 


572 


201 


YLLTXIGNLMMLLVINADSCLRTXM*FFLGH 

FFFLDICYSSVTAQDAAEFPVS*KPILVWGYIT 

*SFFFIFSWGTNGCLLSAITYACYAAICHPLLS 


15 


1365 


A 


257 


425 


68 


THAKPLNKKFN1PI<X,V1LPKLVYIVKAIPTKM 

TFTPIET* HP VKQM IKWQ* LTA WLRNRG YKK1 
KQTPNSETAPSVCRNLVFDKCG 


16 


1366 


A 


263 


104 


481 


FCIFRTTEEDRGGDDCVVSVWTKQRNNSCVK 
SKDVFSKPVNIFWALEESVLGVKARQPKPFFA 
AGNTFEMTCKVSSKNIKSPRYSVLIiMAEKPV 
GDLSSPNETKY11SLDQDSWKLENWTDASRV 


17 


1367 


A 


298 


68 


208 


RKRTNNPIKI.DKJCFEHFKNEDI*ITSKHTKJVIW 
VSSLAMKEMLTKTTM 


18 


1368 


A 


300 


904 


1 


LWGITGTRHHARVIFIFLVETGFPHVGQAGL 
ELLTSGDPPAi^ASQSAGITGMSHCARPICGHFG 
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SEQ ID J 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ED 
slO: of 1 
peptide 
seq- 
uence 


Met 

hod 1 


SEQ 1 
[DNO: 1 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 1 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


\mino acid sequence (A=Alanine C=Cystcinc. 
>=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine s G=Glycine, H=Histidine, 
[=Isoleucine, K=Lysine, L^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown 9 *=Stop codon, 
/-possible nucleotide deletion, \=possible 
nucleotide insertion 














IHLK*MFYTMSQKMP*PTINLILLLIIPGNLNIF 
KPNMGWLGPKTAFV* KDEVLSGIPF AKGRCR 
WK*DY* C/LQEVTDPIMEKGKKKKRTASFFK 
GQPHQSTNALLRRCVR*RYHLS\TVETAGLP* 
KNTGHIPGQPFLFFCL VFK C* N V \ C I * * t k w * 
NIGVKNKSFCPH*SSSPSL*F1GHHSRNF/CSFK 
TEPHSWQAGGQWRNLSSLQAPPPGLMPLSR 
ISLMSSWDYRRPPQ 


19 


1369 


A 


302 


3 


445 


NSPSRWAKIQMFEHTFCG*GCG/ER/NVHlHCjy 
WICRLRPLLWRAVREYLSKLKNAELSFDPGV 
SLLRIYAIDMPTSI* DEKEALLFAFL AFHE*HC 
KSRIW AV1Q/CIHLWD WLRKL* CFHRMKF Y A 
AV*NKPRHLLSHIWKDVQNILLK 


20 


1370 


A 


304 


1 


1339 


FFFCGKEVPLFEQNKHPGPRATTSPGA/HARA 

LLSAGEFTAGVGLSP*AIHSFVWLCTFIQHGA 

GGPCHQPGGSPGPWMHTTQAGHLWEGAYPG 

GSSTWHQVPGQLGGSWGPRERSLLGSFIKCSP 

CPHPPGFRLWMSPNQKPPTENPGVMGRVWR 

LMPGESPLIWEAEGKEDHLSPEGQGHSE/PVA 

PLHSSLGNTVKP*PKNQKPKQNRSRHGQ\GF 

MAGQGQSRPAAR*PPCPALTPASHSAGTWPP 

R]CRTVPGGPCPSPSGFRSCRR*GFSA*TRSWP 

DAEPPSTPDTAPRCCTQSDTSSQGPQ*S*WRR 

CRALPGRLCSAPAAGLRRARPRLSESRRGNSP 

PASPAAASARCPSWGPSCPARPPSRPAAGTEP 

AAPSRCTAWLRGEREPGPRPPGRRPRSGRGP 

VSFAPEVLSLPAVTIQTKSWRWRNEEE1TRPW 

ALVRSRGG 


21 


1371 


A 


326 


799 


1587 


GSQVLPPP 1 PSQDSATLPQDA*GPRAAPGQPVC 
E* GLQG AG VRRLRGEVLCQPQP* G AL*EQCLP 
HLSFSPRQGAAPDTEPSAWGPAPTGATGPGLP 
LRHVRLFSAGAPRGAATPCPPALLHGPAWPP 
ARPMFRGHPPVRPLGPWGKVAAGPRALCLA 
GVPAVQGECATKJPSG* GL*PAHLRGPPGPEVL 
QWHWQLSAGRDPVPAEDPPL* bOrLuruOrA 
AAQAEPGADPEPEDKDQAAESRPAGAMSLSA 

QGSGPVGGQGLR 


22 


1372 


A 


327 


146 


652 


' PHLENPHPEHSFPGAPLT*STLSWSILSPREPSP 
GAPCYPGHPHI.ENPHLEHLLTWRTVTWSTLL 
PGAPCYPEHPHLEHPLTWSTPHLbHrarUiU'L 
SCRTPTRS1LHRDHPLP*CLSTEESPI*GWGSLP 

APPSTPLVLDVAPPGFv^FAob^rOKX'oU iovi 

GTVVSP 


23 


1373 


A 


348 


397 


2 


" CIVSSCQGTRKPCHLEDANKINKQSPTLEKIES 
LQESL*VKQ*LI V AJbK.Y Vi^iL.HrKJs.Kirv ivri - 
NNEKRKMKKRK£EKMCCRERMQRRSKWRR 
EEKKE*RREE\EERKKEKEDRKERRKETSPRG 
SRRLLRD 


24 


1374 


A 


362 


170 


352 


GRALDTAAGSPVQTAHGLPSDALAPLDDSMP 
WEGRTTAQWSLHRKRHLARTLLVSRVRGPQ 


25 


1375 


A 


38'1 


5 i 5 


128 


YLITTILETGYLWKNRHSDQ*KRTENPERDQH 
KYPKVDFCKSN SMKNRLCNKWH WTN WIFTD 
KKINLNLKPHTKLTPNIKKN 


26 


1376 


A 


397 


383 


165 


" EVKNTNPFTFSGTNLTIWIRSI*RKSDEINQRTK 
*MEKYSISLDRRLNTVKMSFLPNLIYKFNTIS1 

KIPANF 


27 


1377 


A 


406 


103 


380 


" KSKATG YMVNI * KLIV\FLY ANDEQLEIEMNK 
IWNFNGSKNKIAFTNLTKYQNIQNRHAENYKI 
LVNKIEDLNKWRNVLLSWIGRRNIINTMT 
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SEQ ID 
NO: or 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO. OI 
peptide 
seq- 
uence 


Met 
nou 


SEQ 

m xt/"v 
ID vs\J. 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
ammo aciu 
residue of 
peptide 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
oi pepiiac 
sequence 


Amino acid sequence (A-Alanine C^Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G-Glycine a H=Hislidine, 
I=lsoleucine, lOLysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i — i nreonine, v — valine, w — 1 rypiopnaJi, 
Y=Tyrosinc, X=Unkno\vn, *=Stop codon, 
/^possible nucleotide deletion, V=possible 

JJUCICUCKJC IJIaCI LILm I 


28 


1378 


A 


408 


14 


427 


TIC l'NKFNNLDEIK/FLERHKLSKLTQEE VENL 

ITLKTSRETELVINK*VIPHKEKPGPDSFTGEF 

YOTFIvFPT /TT/TT I-TKT FOTTKYGRTT PN^VYFT 4 ?! 
i v<j i r ivIjIjL/ 11/ ij_<i ixvi_/i^ i us. i olvll^i no v ili ji 

TLKPKPEKDL\KENYRPLPLSNIDAK\LNKTLA 
NRI**HIR 


29 


1379 


A 


434 


395 


128 


1 Y SKMCMEKQRLNN * ILKKNKV RGIAVPDVK 
VYYKPT VIK/TS VVIL * KDSHIVE WNRLENLEID 


30 


1380 


A 


455 


2 


228 


FFFETESHSVTQAGVQWCNPGFKRPSCFGLSS 
SWDYRYAPPRPVANF\*FLVETGFYYVAQAGL 
KJLLSPGDLPALAS 


1 1 


1 3o 1 


A 






2. 


V^Ldvlr Uls.O V JsJN llrlWV uWi rrr 1 xv Y W IsIN W iol 

crrmnlnpylsryikiksrvkdltvrpepiklv 
eentgktiqdtglgk* fiaktskaqstktnk* 
krqtryiklk:\kkstaskennrvkrqple*ek 

IFAN 


32 


1382 


A 


474 


125 


471 


VKPYEJAVFLVKPIEYK*HLLSDPAIPLSGI*LK 
EIKAYT/RRICTPMFAAPVSVIA/RN*KQSK/CQ 
KQ*YVHRMEYYTTIKRSEILICTTTW\T)FRNT 
ILRETDRIHKTTYDVISLI 


33 


1383 


A 


488 


1825 


2 


ksacsficseeqpaspsplkpgtyase'iarprdp 

haagprlsdsseaetrrprga/dgsgtvvkgt 

pgspappcswghggvetegag*cpaapgtdlr 

apggsags*\glpsaggsrgri<:gwraagrqp 

str*grpgrhggrge*aghpeprqsalqsag 

l/asspepmgaalaedgsgdsrg agprpqe* P 

PSVLSRS\GS*G*G*AASGTASSPRSHSSRLGPP 

SAGFHGLRCGQPPFAAAPFGPWPGTGRPAGG 

AGSPPAAAGTAPPATRGAQSRRQNRTAGRNA 

SPQTAAGAGSPVQWALSRATG*TGETGSWC 

AGGTHQATHLTAAWVCPPTWSVRPGGSGPA 

ALrLGR* (jRHr AQbrPLr V PRO* PA W PQbAPSP 

SPASSEVALSSGSCWPDQAPGPARGSPPAPLA 

PAW P AAGRGRQR* GRQS AHPPPRR* STAVSL 

SGTS*WRRSP*AGTRTQQC*SPWLVPACSSRP 

L*RGTRRPSTQQSPQTTGTPGRSAGPGHPRS* 

OLjKoJt AO 1 OriJLOAv,) A V Aorri^Ori Wr I ALbOL 

WASASPPGPEAPPQTGACIGTNCRYRAASAR 

RSSVAPACA*GWQ*AGSPPAVLRGPP*RVRER 

GALTHRPRAPDE 




1 JO** 


A 


4.07 


/IT) 




A V>d A QVfVD AHA A PH*P Prr.PTPrP PPC A T n\/C/F. 

AGRAGRAGEGRPVPPAYPLCKSAQTSGPPKA 
RLS\PPLASCGGRGPPGGAACATCAPPAGPAR 
SSRCRRRSPPE*GPR*PSRPARPSPGSAASRRQ 
KLTPCRCQFRGLCA 


35 


1385 


A 


509 


156 


475 


PTPYPGE* QAAFLLRGPGLRPPA/DPSLR/HRN 
LTELWAVTDENIVGLFAALLAERRVLLTAS 
KLSTLTS CDHAFCAJLL YPMR^^EH VLIPTLPPH 
LLDYC*CPPLPRT 


36 


1386 


A 


512 


3 


1631 


FFFSFVCHLYCVSPTPGPHGRLATWL/PGLLA 

PI Or A Ar.r.OTl rPAnPI Pr,HAR AHAQriAD^C 

VLIAVPGRRRVIITCGPGPAAPSTRGECPPPAL 

GHTRPARPRPV\PFAPAVPQEPGGQGHGAA/P 

PATGHSAPRGCPPARAAPTGSATPAPPPAACA 

AFHSAWSVPPAGRQQG*RVPAPAFRRTTPGT 

PGQHLLDRPGAPPAQGSGPAPAPPPRLAGPA 

GPAAPPPGPPAASWHSSLSKSSSSIAGWSPPLP 

VGPGSLQ+TPPPQGPIILSGSCGGTSSWRGQR 

AAVARRLRSWNACGLSRVAGRSSASYPGRE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
MO: of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 
[D NO: 
in 

USSN 
09/496 
914 


Predicted ) 
jeginning i 
nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^ Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine s G=Glycine, H=Histidine, 
l=Isoleucine, K>Lysine.. L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine. X=Unknown. *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














GRPSQSQ*PAGPPGMRGCCLRGW*PSSSGSD 
GPGPHPASTWLRAGKTGPSPPACGCA*LPPPS 
VSAAPQSPRTRCPRGCAAAAGLCVLAAAGAS 
HG A\GLPG VRVHTQRVHIH* GAG/GCQTPRPR 
LRSLPVLGLPAPRCPVSAHPWHRRSGSSCHA 
ARLVPRHPAPGCP* *TG*\PLITGFPEP* A*GLP 
NHQAVGLEASGALQAGHRDELPTMVQLLDH 
SPDYPLKGRPHAP 


37 


1387 


A 


620 


828 


1 


FRLPLAAGA/RGAAEPRVAVbMArDt'bAivll-i 

WEASPEMQSKCHQKGKNNQTECFNHVRFLQ 

RLNSTHLYACGTHAFQPLCAAIDAEAFTLPTS 

FEEGKEKCPYDPARGFTGLIIDGGLYTATRYE 

FRSIPDIRRSRHPHSLRTEETPMHWLNG*EDE 

AQDDGG*GTISSFLLPWPAJL>llr 1 rKbf ui^r vn 

SIPVCCQVRGQPQSGGKESPACLKSLSNCLTH 

VDAEFVFSVLVRESKASAVGDDDKVYYFFTE 

RATEKESG SFTQ SRSSHRV ARGIPPL 


38 


1388 


A 


739 


1 


427 


FRAMVSSTLKLG1SILNGGNAEVQ/QGNRGKG 
TSEEGKEG*EVPV*LPVSPPLPRPLQKMLDYL 
KDKKEVGFFQSIQALMQTC\GEKVMADDEFT 
QDLFRFLQLLCEGHNNDFQNYLRTQTGNTTT 
IN IIICT VD YLLRLQES I 


39 


1389 


A 


767 


I 


1030 


TLDLTGPLLLGGVPNVPKDFRGRNRQFGGCM 

RNLSVDGKNVDMAGFIANNGTREGCAARRN 

FCDGRRRQNGGTCVNRWNMYLCECPLRFGG 

KNCEQGEWPASSIPPVTAAWEALLLDVPG1 1 

VRGLHIQVRQPLYVYAAFTVDSHRPLQETVL 

RRAPAPASGVPSPSGVGWDR*AGPAEPSPSTP 

ATV IIS VPW YLGLMFRTRVKEDS VLMEATSGG 

PTSFRLQVTGAPCHQGTC*VGAKOKJJrMLau 

LRVTDGEWHHLLIELKNVKEDSEMKHLVTM 

ti nvr\/nnvc:\x^rT HT 1 Wfi*TT PPAOGKTGA 

SEDKVSVRRGFRGCMQVRGGCGGRGEACPS 

QAAPRL 


40 


1390 


A 


801 


69 


"399 


" """iHKinHKEDLNKWKYlLCSGMERLSTVMlPVV 
PQIIYKFNA* QWILKFT W*E*G AKITILRKNKL 
RGLVLVPLS TC* VKYLJ-lJJvVL.rrinvi i ir,ruv 
VNKSVVLVQVTIM 


41 


1391 


A 


835 


7 


195 


" SMLKERKVFQFPSCLFFQYITWLGPPYHVLFD 
SSV rNFSIGAlv* LULi^b viviml^l i /usjxir v i 


42 


1392 


A 


841 


1 


415 


" ~ G STHASG YDKTPDFILQVPVA\ r EGHIIH W1ES 
KASFGDECSHHAYLHDQFWSYWNSLKHRTW 
QGIGTVASNLSQL*TLKAPFPELLLFRSLARTG 
FVLT*\RFGPGLVIYWYGFIQELDCNRERGILL 

KACFPTNIVTL 


43 


1393 


A 


845 


358 


92 


PALSPAPVPQKKGSPLPLDPCLGPSSWLLSVG 
LGWPRL*PRRGPGDPGSLPATPPLLTPPHTLLP 
QRPMLPPSH AGLARPrPrbr lo V 1 


44 


1394 


A 


853 


452 


1 


LPQYCFFPRLSPKSKLVKHSAL**PSALKPPTK 

SPRCIPRTSLYFTICOPPALQL/SPIEDPPAIYRS 

PPTHMLRSASQPLNQAPTLVKGHPPSRFLQG 

QVSCPPQPTLPREKPLPLHLRPPPRPAQPPLPR 

PLTFSTRRNVDPEIPERFR 


45 


1395 


A 


894 


379 


162 


■ GV YPPTVFDN YS VQTSVDGQIVSLNTWDTAG 
QEEYD/RLRTLS*PQTS1FV1CFSIGNLEFPIYGT 
WLSMSMGK 


46 


1396 


A 


900 


1 


366 


' TTKKTL1SNNVSSRSLP1LPELKAFSLAFNDPL 
EIQKYMRT/DQ* CVTHDISL YIVTKLALIFLIPR 
WLFHQLNrr**Cl.HFFTMTTFIAIPFSFLFLGR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine. 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine ; P=ProIine, 
Q=Glutamine ? K=Arginine ? S=Serine, 
T=Threonine, V=Valine } W=Tryptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














D/KSLAMLPRLVSNSWPQVILPP 


47 


1397 


A 


944 


162 


2 


QLQNLASRGCL*SQLLRRLRRENRLNPGGGG 
CSEIAPVCTPAWVTQRDFFRKKK 


48 


1398 


A 


963 


216 


308 


HFTPDR1AIVKNTRDSHCWRGC*EEGAPARC 


49 


1399 


A 


967 


466 


1 


PRKRES WWGERLP/PRGFPPAAED APAPG WK 
GRKHASRTARAHVFHPIRQSIRSPVRGRPGDP 
RAAHTRSAGTRLQCKASRGG*GKGPAPTR*E 
GGPGSAPAPLPASSGCSLFPDSSPWTPPPPAPG 
AAAAQP* *TPRCPAALRAG AHIGR VGRPY 


50 


MOO 


A 


973 


45 


421 


EICCIQALD VF VFC YIDHSSHCLMSCD* E/DQ A 

LNFMPLEMEPKMSKLAFGCQRSSTSDDDSGC 

ALEEYAWVPPGLRPEQIQLYFACLPEEKVPY 

VNSPGEKHRJKQLLYQLPPHDNEVRYCQSLSE 

E 


51 


1401 


A 


992 


2095 


194 


IRIRHEAARSCLGCAAGHVPAPGLRLLPTVRG 

PPGRRGPAAPGCVCY*SGESTFVSHVPQRMA 

WPGSAPPRGFHPLQSQTSPSDTVSSPQLSKEE 

DGPGWEHPLSSSL*SLGQAGGNH*QPEELAG 

WEPRGPPSLAPSSPT/TMWTALVLIWIFSLSLS 

ESHAASNDPRNFVPNKMWKGLVKRNASVET 

VDNKTS ED VTMAAASPVTLTK GTS AAHLNS 

MEVTTEDTSRTDVSEPATSGVAADGVTS1APT 

AVASSTTAASITTAASSMTVASSAPTTAASST 

TVASIAPTTAASSMTAASSTPMTLALPAPTST 

STGRTPSTTATGHPSLSTALAQVPKSSALPRT 

ATLATLATRAQTVATTANTSSPMSTRPSPSKH 

MPSDTAASPVPPMRPOAOGPISOVSVDOPVV 

NTTNKSTPMPSNTTPEPAPTPTVVTTTKAQAR 

EPTASPVPVPHTSPIPEMEAMSPTTQPSPMPYT 

QRAAGPGTSQAPEQVETEATPGTDSTGPTPRS 

SGGTKMPATDSCQPSTQGQYMV/DHI I* APHP 

GRGRQNSPSGGAVTRGDPFHHSLGFVCPAGL 

* ELQEEGLHPGGLLNQRD VCGLRNVRG AG A 

WREAWPLPRPFLLPLRPNQVLPNSFGAIEEIC 

QMLKHI 


52 


1402 


A 


994 


1 


462 


ESGEFLVSFTLKJKPTNVFHHINGMKFFNK/LIF 
*SHTDIAFYKJQHPFMLKALTKWA*EGT*PDR 
RYLH* SLRLNGEQLKTFPLRSGMR*G/CAILPL 
VLNAMLSIVPAVVPAGKTRHEKETTCPETGQE 
EK*FS*FVGDMNTCVENKJKESKJCLLE 


53 


1403 


A 


1011 


1 


630 


PEVIQQSAYDSKADIWSLGITAIELAKGEPPNS 

D^PMRVLFLIPKNNPPTI-ICWRRELESFKEV 

*LlyO,A*TKDPSI\RPTAKELLICHKFIVKNSKKT 

SYLTEL1DRFKRWKAEGHSDDESDSEGSDSES 

TSRENN'J HPE W SFTT VRKKPDPKK VQN G AEQ 

DLVQTLSCLSMIITPAFAELKQQDENNASRNQ 

AIEELEKSIAVAEAAGPG 


54 


1404 


A 


1016 


I 


222 


ISIDA*KAFDKIQH/CFMITTLKKLGIDGKYTN 

TIKAIDDRHTVSTILNVEKLKAFL*RSGTRQRF 

PISGSGARI 


55 


1405 


A 


1033 


3 


366 


HASVDGDEGSDDVYYYYTPAILRELQALNTA 
EAAEHRPEEDRMLSEDPWRPAHMIKGYMPL 
HNIPHTEVIDVTGLNQSHLYQHLNKGTPMKT 
QKRAAVLYTWHVLEQLEILRQINQQSHGPG 


56 


1406 


A 


1044 


5 


429 


SVLTLQTRSPSICPLSVRICLMDWEVVSRNSISE 

DRLETQSRASRSPPVTPNQSQETPVDGKPLAL 

PPNQSQKNIRYHIHYLHLQYYLDRHISATLPIP 

SSSGIPTPIAVITDALTDLVELILGQPCSEESGR 

APGTLFLLAL 
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SEQ ID J 
NO: of 1 
nucl- ] 
eotide 
seq- 
uence 


SEQ ID I 
slO: of 1 
peptide 
seq- 
uence 


Viet 5 
lod 1 

i 


SEQ I 
DNO: 1 
n i 
USSN 
39/496 
914 


Predicted 1 
beginning i 
luclcotide 
ocation 
correspond! 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end * 
iucl eotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


\mino acid sequence (A=Alanme OCysteme, 

pi A crtortir An\A P=f~rlllta.niiC Acid, 

Phenylalanine, G=Glycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucinc 3 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T _-pi_ rpon ; ne v=V aline W^Tryptophan, 
Y^Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


57 


1407 


A 


1050 


11 


430 


GAYAFETNGFPIMLVLTTDKIEGDVGIAGLYD 
MHXISLPMAFLLRTLVRCTSYIIPVTHVLSTPV 
TCLRRREKDGVIVDVLSDTASNHN GFPVEEH 
ADDTHPARLOGPTLRSQPMGPLKHKAFEERA 
NLGLVQRRLRLED 


58 


1408 


A 


1058 


258 


419 


LBCHRDTPWGANNRALSCTPLTSLTLCALCf L 
PPT GCPTXATCRLYOTTVAVVF 


59 


1409 


A 


1064 


3 


425 


KAFSFTTSLIGHQRMHTGERPYKCKECCjK 1 h 
KGSSSLNNHQR1HTGEKPYKCNECGRAFSQC 
SSLIQHHRIHTGEKPYECTQCGKAFTS1SRLSR 
HHRIHTGEKPFHCNECGKVFSYHSALIIHQRIH 
TfiEKP YACKDVGK 


60 


1410 


A 


1065 


204 


419 


" GGPPGPFL AHTHAGLQAPGPLL AP AviDEUUL 
LLLAVQQSCLADHLLTASWGGK/DPIPTKALG 
EGQEGLPLTV 


61 


1411 


A 


1079 


3 


383 


RHSRAHLCQPFHLVMRDLLQLGQDIFQUCH Y 
LEENHLIHRDIAARNCLLSCAAPTRAATIGDF 
GMARYIYRTRYYQLGDRAL/LPRKWMPPEAL 
LEGIFTYNTDSWTFGVLLWEIFSLGYMPYPGR 

TN 


62 


1412 


A 


1080 


1 


859 


VVEFLWSRRPSGSSDPRPRRPASKCQMMbEK 

Aivn \/ru\/t\/Tit r i <sTirVT T O^AT SLGRSLDADHA 

PLQQFFWMEHCLKHGLKVKKSFIGQNKSFF 

GPLELVEKLCPEASDIATSVRNLPELKTAVGR 

GRAWLYLALMQKKLADYLKVLIDNKHLLSE 

FYEPEALMMEEEGMV1VGLLVGLNVLDANIA 

CLKGEDLDSQVGVIDFSLYLKDVQDLDGGKE 

HERITDVLDQKNYVEELNRHLSCTVGDLQTK 

mm FfcTTNTCia OFRVSAATDRICSLOEEQQQL 

RFOMFLTR 


63 


1413 


A 


1083 


2 


615 


SSFAKHKR1HTGEKPF1CLECGKAF fSSri L i K 

HRRIHTGEKPYTCEECGKAFRQSAILYVHRRI 

HTGEKPYTCGECGKTFRQSANLYAHKKIHTG 

EKPYTCGDCGKTFRQSANLYAHKKJHTGXEKP 

YKCKECGKAFKSYYSILKHKRTHTRGMSYEG 

DEC/QRSLN/RSSILSNHKIIHNEEK/PLKCEKCE 

KAFNHTSICCRHKKN 


64 


1414 


A 


1084 


946 


1 


' " KKQDL S S SLTDD SKN AQ APL ALTESHL A I LA 
c;q«?o^PF ATKOl LDSGLPSLLVRSLASFCFSHIS 
SSESIAQSIDISQDKLRRHHVPQQCNKMPITAD 
LVAPILRFLTEVGNSH1MKDWLGGSEVNPLW 
TAT LFLLCHSGSTSGS\HNLG\AQQDQCKISFS 
FFSWLTTGLTTQQRTAIE\NATVAFF\LQCI\SC 
HPNNQKLMAQVLCELFQTSPQRGNLPTSGNI 
SNGFIRVRLFLQLMLEDEKVTMFLQSPCPLYKG 
RINATSH\^QHP\MYGAGHKFRTLHLPVSTTL 
SDVLDRVSDTPS1TAKLISKQKDDKKKK 


65 


1415 


A 


1087 


103 


324 


' PR APFFVHTEMIVG/RVONIHLFTLQVLEDRA 
LFTMSVG SSL WSTYLIHVMALP/DRELLKPN A 
SVALHKLSNALV 


"" 

oo 


1 A1 fs 

IH- lO 




1095 


3 


493 


" HETCSVTHIVSFSLPFLNPSHPASTPGHTENEQ 
PSLVWFDRGKFYLTFEGSSRGPSPLTMGAQD 
TLPV AAAFTETVNA YFKG ADPSKCI VK1TGE 
MVLSFPAGITRHFANNPSPAALTFRVINFSRLE 
HVLPNPQLLCCDNTQNDANTKVEFWVNMPNL 
MTHI.K 


67 


1417 


A 


1098 


57 


356 


' LKLTSLGF1IGVSVVGNLLISILLVKDKTLHRA 
PYYFLLDLCCSDILRSA1CFPFVFNSVKNGST 
WTYGTLTCKVIAFLGVLSCFHTAFMLFCISVT 
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SEQ ID 
NO: ot 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
MU: 01 
peptide 
seq- 
uence 


Met 

i, „ _i 
noo 


SEQ 
1JJ iNU. 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc C=Cystcine, 
D=Aspanic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
l=Iso leucine, K=Lysine, L-Leucine, 
M=JVlethionine, N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














RYL 


68 


1418 


A 


1106 


1 


1326 


MGKISATGINMGTKCSWALVWIILESYDPKH 

YEREGMQDWKTASGQSEEATQQSSQKPQPH 

YTTYQSSSFLKYSSESHLLAWRENSSEGSFQF 

PGRSRARPPRTRQQRRGAAAGPGRGAVRLG 

HPQSAAQPQLRAAARIPESPAAFPAQPRPGSA 

RNSDASGPASLSRTLGRASSPRPPQAPDVTAP 

O T*» A AT A TVT^ 4 A Y*h /""» 171 T"» A A A T A /*> A f""> A 1~> TNT T"1 ~r*i 

SPAALAPRAARGGSRAAALA GAEAEEPLRTL 

APRPTRAAAPPPPPPPPPLPPG APPPP VRC V SR 

RARAPPWR/PAATGPPPXRPVAPSRKI.GSARAP 

APALQIRKGTSSGLPGRGGGSGPGNNLSSVA 

GNWRGSSFAVERPGMAKYQGEVQSLKLDDD 

bVltGVbDQVLVAVVVSrALlATLVYAJLFRNV 

HQNIHPENQELVRVLREQLQTEQDAPAATRQ 

QFYTDMYCPICLHQASFPVETNCGHLFCGSLT 

PNSIW 


69 


1419 


A 


1107 


2 


466 


FDTARLHEFGTSITQIFAVDNREDLQKWMEA 

FWQHFFDLSQWKHCCEELMKIEIMSPRKPPLF 

LTKEATSVYHDMSIDSPMKLESLTDIIQICKIEE 

TNGQFLIGQREESLP/SS/CGPHSLMVTIKWSS 

RKRY/SYPASEPLHDEKGkkRQAPLPPSDK 


70 


1420 


A 


1111 


698 


23 


ALRRLHYVRA r rKV\FLSFlU<PFWREEH[EGGH 

SNTDRPSRMIFYPPPREGALLLASYTWSDAAA 

AFAGLSREE.ALRL ALDD VAALHGP WRQL W 

DGTGVVKRWAEDQHSQGGFVVQPPALWQT 

EKDDWTVPYGRIYFAGEHTAYPHGWVETAV 

KSALRAAIKINSRKGPASDTASPEGHASDMEG 

QGHVHGVASSPSHDLAKEEGSHPPVQGQLSL 

QNTTHTRTSH 


71 


1421 


A 


1119 


2 


385 


QKQTLQNGYLDSSMDILYLGSLPPELQVSSDE 
PPGPPEQAGLSQFHLEPETQNPETTEEIQSSXLQ 
QEAAAQLPQLPEVVELSSTKAAEAPALPSQSL 
EGVHSSTEQKAPAQQLPAFEEILAPLLIHHE 


72 


1422 


A 


1127 


1 


906 


HAQYVGPYRLEKTLGKGQTGLVKLGVHCIT 

GQKVAIKIVNREKLSESVLMKVERE1AILVRLI 

EHPH\>TLKLHGVYENKKYFPPDELTSGPSMLA 

QVSPHGKLSARRSWDLLSGFPRYLVLEHVSG 

GELFDYLVKKGRI.TPKEARKFFRQTVSALDFC 

HSYSICHRDLKPENLLLDEKNNIRIADFGMAS 

LQVGDSLLETSCGSPHYACPEVIKGEKYDGR 

RADMWSCGVILFALLVGALPFDDDNLRQLLE 

KVKRG VFI IMP! IFIPPDCQSLLRGMIEVEPEKR 

LSLEQIQKHPWYLGGNFIS 


11, 
/ j 


1 All 


A 

A. 




1 


802 


LKJN ALDVLHRE VPRVL VNL VDr LNPT1MRQ V 

FLGNPDKCPVQQA/MLEPLGSKTETLDLRAE 

MPITCPTQNEPFLRTPRNSNYTYPIKPAIENWG 

SDFLCTEWKASNSVPTSVHQLRPADIKVVAA 

LGDSLTTAVGARPNNSSDLPTSWRGLSWSIG 

GDGNLETH 1 TLPNILKJvF NPYLLGFSTSTWEG 

TAGLNV AAEG ARARDMPAQ A W DL VERMKN 

HLATEYVQHIQQALDILSE 


74 


1424 


A 


1139 


60 


480 


FREPCLLVPGDHQPLREASWLA/LPPIGLWGT 

DSPLCCVEVAIPCNKGAHSVGLKGWLLAQG 

VLGMRDTIPQEHPWESTPDLCFCRDPEEIEVE 

EQPAADAAVAKGEF/QGEQIAPVPAUIAAHPE 

AADPAPVHTTAHPKGA 


75 


1425 


A 


1147 


2 


413 


PFPHQHPQEP\KGSCWPQSALRGQCPGPVLGV 
TTTSDLCSLQVTVSSHRNPLLblTAAYDQEGR 
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SEQ ID 1 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID ] 
SJO: of 
Deptide 
seq- 
uence 


Viet 

hod j 


SEQ 1 

[DNO: 

n 

USSN 
09/496 
914 


Predicted 

oeginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Ajnino acid sequence (A=Alamne C-Cysteine, 

Ly^ASpartlC ACIQ, C.— WlUlcU'llL, /yuiu, 

^Phenylalanine, G=Glycine, H-Histidine, 
I=Isoleucine, K=Lysine ; L^Leucine, 

xyi A/fc»»Viir\ni-nf> "M=: A enar^crinp P— Proline. 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine 3 X=Unknown, *-Stop codon, 
/^possible nucleotide deletion, \=possible ! 

nilCICULIUC llloCl llvll 














RFDNFSSLSIQWESTRPVLAS1EPELPMQLVSQ 
nnp coOKKI HGLO AIL VHEASGTTAIT AT AT 
GYQESHLSSAR 


76 


1426 


A 


1155 


38 


410 


PI IS AP AQDDPILLSFIHCLH ANLLCV WRRD V K 
PDCKEIWIFWWGDEPNLVXVQYIMNCMLWK 
KDSGKIV1AFPMNVGRC/FFKEIHNLLERCLMD 
KNFVLIGKWFVRPYYKBEKPVNKSEHLSCAF 

T 


77 | 


1427 


A 


1162 


526 


350 


RFPQGLEDVSTYPVLIEELLSRGWSEEELQUV 
LRGNLLRVFRQVEKVQEENKWQ SPEED 


7S 


1428 


A 


1171 


I 


1293 


MAESASPPSSSAAAPAAEPGVTTEQPGPRSFF 

SSPPGLEEPLDGADPHVPHPDLAPIAFFCLRQT 

TSPRNWCIKMVCNPWFECVSMLVILLNCVTL 

GMYQPCDDMDCLSDRCKILQVFDDFIFIFFA 

MEMVLKMVALGIFGIOCCVLGDTWNRLDFF1 

VMAGM VEYSLDLQNlJNLoAlKl VKvi^i^L-is-rv 

INRVPSMRILVNLLLDTLPMLGNVLLLCFFVF 

FIFG1IGVQLWAGLLRNRCFLEENFTIQGDVAL 

PPXYYQPEEDDEMPFICSLSGDNGIMGCHEIPP 

LKEQGRECCLSKDDVYDFGAERQDLNASGL 

AWIVIFQVITLEGWVEIMYYVMDAHSFYNFI 
ven i nvQvpppni r no^FSTAOSPKCOGDSFP 
GVAAESLLLRGWVLWLPGGG 


79 


1429 


A 


1175 


1 


405 1 


PNDFFKDMFPDLPGGPLGPIKAENDYGAYLN 
FLSATHLGGLFPPWPLVEERKLKPKASQQCPI 
r^uv\n\An a r^v: t prt-t\ARTHTGFKPYMCTICE 
VRFTRQDKLKIHMRKHTGERPYLCIHCNAKF 

VHNYDLiCNHMR 


80 


1430 


A 


1182 


25 


198 


EMN ELSQQLSQQGGRGASQCPSPP aptlpnpt 
PLCQLQLQRVNTGLPTPPCHPGAGAA 


81 


1431 


A 


1186 


254 


5S3 


ktvldvgagtgilsifcaqagarrvyaveas 

AIWQQAREWRFNGLEDRVHVLPGPVETVEL 
PEQVDAIVSEWMGYGLLHESMLSSVLHARTK 


82 


1432 


A 


1187 


2 


716 


DFVDAARNLPLESTKSPAEPSKSVPSLENDPRA 

SSQGLPSQGPVQNQGRRGEQRPKKF/TVIQHT 

SSFEKSDSLEQPSGLEGEDKPLAQFPSPPPAPH 

rnCA nn, ot>Tri VPOPMiOVPFlI VTEEPDRPD 

TEPEPPPKEPEKTEEFQWPQGSQTLAQFPVEK 

LPPKKJCRLGLAKMAQSSGESSFESSVPLFRSP 

SQESNVSLSGSSRSALFERDDHGKLAJBAPSPSF 

DMGPKPLGTHMLTV 


83 


1433 


A 


1188 


517 


804 


" rcpri ovvi PTPrAFAYPFl FDNLPLF YRLGLC 

WGRGHGCGQEALSTSHGYHLFCALLTGFLFA 
SHLPERLAPGRFDY1GHSHQLFHICAVLGTHF 

Q 


84 


1434 


A 


1192 


45 


476 


t ^rn/rc\u\/PDTP\/HF A AOROFST X /IX)ULltJS 

L(jD\ Or W V t,xv 1 r v jtjzi/aj^v^^^^^V^VV^^-^ 0 

GACVNQVTVDSITPLHAASLQGQARCVQLLL 
AAGAQVDARNIDGSTPLCECLRLGQHRVCEA 
LAVLRGQGQPSPVHSVPPARGLHXREFRMC* 
GFLFDVGXNLEAHEFHFGEP 


85 


1435 


A 


1194 


69 


410 


™ KRSEEAS APPFPLGGTGAAPTRASLPEQILLPR 
SCLEARKSQPDEKLLSALHNSRTWN*EPRRSQ 
HRLVSPEVHPGRRGSSPGVAECKLTSAYFRT 
GRSPCPSLPGTTRTNSLL 


86 


1436 


A 


1215 


3 


405 

1 


~ LPSHTCGNPGRLPNGIQQGSTFNLGDKVRYSC 
NLGFFLEGHAVLTCHAGSENSATWDFPLPSC 
RAJDD ACGGTLRG/AJEWHHLQPPLPLG/ ATKN 
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SEQ ID 
nucl- 

pnt i fit-' 

seq- 
uence 


SEQ ID 

1N\J. OI 

peptide 
seq- 
uence 


Met 

hod 


SEQ 

ir> no- 
il/ INU. 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 

lU^.aUUl i 

correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponai 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AJanine OCysteine, 
D=AspQrtic Acid.. E=Glutamic Acid. 
F=Phenylalanine, G^GIycine, H^Histidine, 
i— iso leucine., iv — L»ysine, l,— Leucine, 
M=Methionine, N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine 3 S=Serine s 
T=Threonine, V=Valine. W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop cod on, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














NADCTWTILAELGDT1ALVF1DFQLEDGYDFL 
FVTPiTPri < s , ?l w 

C/V J U 1 DvJjoL w 


87 


1437 


A 


1216 


226 


964 


GTARFGPMVGFGANRRAGRLPSLVLGVLLV 
VIVXa.AFNYWSISSRHVLLQEEVAELQGQVQ 
RTEVARGRLEKRNSDLFA V VGHA QETDRPEG 
GRLRPPQQPAAGQRGPREEMXEDDKVKLQNN 
ISYQMADIHHLKEQLAELRQEFLRQEDQLQD 
i i"\Jcv:n1N 1 i L, VJSJvUfc, Y toryLuyyMlsx/Ll^yn 
EENIKKLADQFLEEQKQETQKIQSNDGKELDI 
NNQVVPKNIPKVAENVADKNEEPSSNHIPHG 


8S 


1438 


A 


1218 


1 


534 


PEFGTTISCGYLMATDVSRRPSVHKAVE1EQE 
RVKSAGAWIIHPYSDFRFYWDLIMLLLMVGN 
LIVLPVGlTFFKEENSP\PWrV'FNVLSDTFFLLD 
LVLNFRTG1WEEGAEILLAPRAIRTRYLRTW 
FL VDLI S S IP VD YIFL V VELEPRLD AE VYKTAR 

AT Xf i\f"D ~C"W n ct r or 


89 


1439 


A 


1223 


1 


743 


MGFDEVFMINLRRRQDRRERMLRALQAQEIE 
CRLVEAVDGKVGMLTRSNAAPGRHLAMLET 
LVVVAPRFVDADNULNPDTLSLLIAENKTVV 
ArMLUbKAAYsNl- WCGMTSQG Y YKRTPAYI 
PIRKRDRRGCFAVPMVHSTFLIDLRKAASRNL 
\AFYPPHPDYTWSFDDIJVFAFSCKQ\AEVQMY 
VCNKEE YGFLP VPLRAH STLQDE AESFMH V Q 
LEVMVPSSPSSAQSMAVVSADH1GLVISYL 


90 


1440 


A 


1227 


2 


349 


NKTSF1FYLKNIVVADUMTLTFPFRIVHDAGF 
GPWDFKFILCRYTSVLFYANMDTSIVVLGLIT/ 
YDRY/WKVVRHL/WDSWMTGI/SFTRVYLI.G 
LGARLVWFGKXILAKGGHGGISWL 


91 


1441 


A 


1245 


3 


1937 


LGSSDVRAPQRSELGAESPSRMVASQAYNLT 

SALTPILTRSRVLNEEPLTLAGF\SRAPANLSD 

WQLIFLVDSNPFPFGYISNYTVSTKVASMAF 

QTQAGAQIPIERLASERAITVKVPNNSDWAAR 

GHRSSANSVWQPQAFVGAVVTLDSSNPAAV 

LHLQLNYTLLDGRYLSEEPEPYLAVYLHSEPR 

PNEHNCSASRRIRPESLQGADHRPYTFFISPGT 

RDPVGSYRLNLSSHFRWSALEVSVGLYTSLC 

QYFSEEDVVWRTEGLLPLEETSPRQAVCLTR 

HLTAFGTSLFVPPSHIRFVFPEPTADVNYIVML 

TCAVCLVTYNWMAAILHKLDQLDASRGRAIP 

FCGQRGRPKYEILVKTGWGRGSGTTAHVG1M 

L r O V D z> Kb G HRF1L DG D RAFHRN SL DIFQ I ATP 

HSLGSMWKIRVWHDNKGLSPAWFLQHIIVRD 

LQTARSTFFL VND WLS VETEANG GL VEKE VL 

AASKASFRVPTPSXAALLRFRRLLVAELQRGF 

FDK.HIWLSIWDRPPRSCFTR1QRATCCVLLICL 

FLGAMAVWYGAVGDSAYSTGRVSRLNPLSV 

Dl VAVULVooV V V Yr V i LAlLrlJrKJVloKSKV 

GWGWGPGSTGNGAWASAPCPEPPLSSAAAR 
GKGVHQRLLGKGQHT 


92 


1442 


A 


1246 


5 


562 


VFDEENILNELNDPI..REEIVNFNCRKXVATMP 

1 KANAnPNPV'PAMF 'skTI Vi KI^WDPHriYllRPri 
i-r i /A In r\^LJ r i\ r v i /AJ VL I. j o rvL x vjr 12/ V r V^rUU I ILivC'vJ 

AVGKKMYFIQFIGVAGVITKSSKEMKLTDGS 
YFGEICLLTKGRRTASVRADTYCRLYSLSVD 
NFNEVLEEYPMMRRAFETVAIDRLDRJGKKN 
SILLQKFQKBLNTGVFNNQENEILKQIVKH 


93 


1443 


A 


1249 


180 


901 


TVPPPPGGPSPAPLHPKRSPTSTGEAELKEERL 
PGRKASCSTAGSGSRGLPP\SSPMVSSAHNPN 
KAEIPERRKDSTSTPhWLPPSMMTRRNTYVCT 
ERPGAERPSLLPNGICENSSGTPRVPPASPSSHS 
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SEQ ID 
NO: of ' 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 1 
ID NO: 1 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide | 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-Alanine C-Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine a 
l^Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
[ s ^=Oiutamine } iv— - /\rgiuine, o acinic, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














LArrovjbKoKLAKuo 1 irv.o l rnuvjy v m^imvtvu 

GGGGGGVQNGPPASPTLAHEAAPLPAGRPRP 
TTNLFTKLTSKLTRRVADEPERJGGPEVTRRP 
RQEDHLSPGGRGCSEL 


94 


1444 


A 


1261 


3 


385 


"KFSQWGLTKPKLSNASP/WISLVKKLMKKWS 

VTQNLTFREQLEAGIRYFDLRVSSKPGDADQ 

r>i\rr*TTir^T v:nT[^\r\\JT\ni ivyfFTr*<?FT TOT-TPOFTTFI 
ElYr IHOLrOllSA W JJOLlVUllL^or.u i \rrir\£i^Lir Kj 

DFNHFYAMDETHHKCLVLRIQEAFGNKLCPA 
CR 


95 


1445 


A 


1282 


2 | 


550 


GPRDNPGVEDPRFEIVEHFGIAWFTFELVARFA 

V APDFLKFFKN ALNLIDLMS1 VPFYITLVVNL 

VVESTPTLANLGRVAQVLRLMRIFRILKLARH 

STGLRSLGATLKYSYKEVGLLLLYLSVGISIFS 

VVAYTIEKEENXEGLATIPACWWWATVSMTT 

VGYGDWPGTTAGKLTASACILA 


96 


1446 


A 


1294 


1 "1 


1456 


QLLPPSNRENAGLLVGRCLCSAALRPVGDLIT 

SSGQVAVRNAPQAGSAKAGKGKFQDNFEFIQ 

YFKKFFDANCNEKDYNPVAAGQGQETEVAP 

SIV APVLNKPNQCPEGYICVKAGRNPNYGYT 

SFDTFSWAFLSLFRLMTQDYWENLYQLTLRA 

AETTYMIF/LV/LV1LLGSLYLVTLILAV/V AMA 

YEEQNQATLEEAEQKEAEFQQMLEQLKKQQ 

E AAQQ AAT AT AS EHbKEr a AA<jKJLoL» i> o o c/^o 

KLSSKSAKERRNRRKKRKQKEQSGGEEKDED 

EFQKSESEDSIRRKGFRFSIEGNRLTYEKRYSS 

PHQSLJLSIRGSLFSPRRNSRTSLFSFRGRAKDV 

GSENDFADDEHSTFEDNESRRDSLFVPRRHGE 

RRNSNLSQTSRSSRMLAVFPANGKMHSTVDC 

NGVVSLVGGPSVPTSPVGQLLPEVIIDKPATD 

DNGTTTETEMRJCRRSSSFHVSMDFLEDPSQR 

QRAMS3ASILTNTVE 


97 


1447 


A 


1295 


2 


2057 


1QTQLPTKSSQQLRKGGNCVRCKMQMNFIAE 

EVLLKYR1TFYNNNKGPNMLYIEIKAFVHFM1 

NRYLSYGSGPKRFPLVDVLQYALEFASSICPV 

CTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALS 

SELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLP 

MHP APRHITEEELS VLES CLHRWRTEIENDTR 

DLQESISRIHRTIELMYSDKSMIQVPYRLHAV 

LVHEGQANAGHYWAYIFDHRESRWMKYND1 

AVTKSSWEELVRDSFGGYRNASAYCLMYrN 

DICAQFLIQEXDLIKTGQPLVGIETLPPDLRDFV 

EEDNQRFEKELEEWDAQLAQKALQEKLLAS 

QKLRESETSVTTAQAAGDPKYLEQPSRSDFSK 

HLKEETIQ1ITKASHEHEDKSPETVLQSAIKLE 

YARLVKLAQED IrTr, 1 JJ i K-Lrm v vvi riv^V 

APKKIIEKTLLEQFGDRMLSFDERCHNIMKVA 

QAKLEMIKPEEVNLEEYEEWHQDYRKFRETT 

MYLI1GLENFQRESYIDSLLFLICAYQNNKELL 

ESGEDREVNNGLI1MNEFIVPFLPLLLVDEMEE 
KDILAVEDMRNRWCSYLGQEMEPHLQEKXT 
DFLPKJLLDCSMEIKSFHEPPKLPSYSTHELCER 
FAR1MLSLSRTPADGR 


98 


1448 


A 


1304 


118 


453 


SGPSSRAIYLHRKEYSQNLTSEPTLLQHRVEH 
LMTCKQGSQRVQGPEDALQKLFEMDAHGRV 
WSQDLILQVRDGWLQLLDIETKEELDSYRLD 
S1QAMNVALNTCSYNSILS 


99 


1449 


A 


1306 


3 


| 1660 


CGYFCHTTCAPQAPPCPVPPDLLRTALGVHPE 
TGTGTAYEGFLSVPRPSGVRRGWQRVFAALS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
nod 


SEQ 
ID NO : 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D^Aspartic Acid. E=Glutamic Acid, 
F=Phenylalanine, G=GIycine, H=Histidine, 
1=1 so leucine, K-Lysine, L=Leucine, 
M=Methionine, N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V= Valine, W=Tryptophan, 
Y-Tyrosine, XHJnknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














DSRLLLFDAPDLRLSPPSGALLQVLDLRDPQF 

SATPVLASDVIHAQSRDLPRIFRVTTSQLAVPP 

TTCTVLLLAESEGERERWLQVLGELQRLLLD 

ARPRPRPVYTLKEAYDNGLPLLPHTLCAAILD 

QDRLALGTEEGLFVTHLRSNDIFQVGECRRVQ 

QLTLSPSAGLLV\a.CGRGPSVRLFAJLAELENI 

EVXEVPKIPESRGCQVLAAGSILQARTPVLCVA 

VI<UlQVLCYQLGPGPGPWQRIuRELQAPATVQ 

SLGLLGDRLCVGAAGGFALYPLLNEAAPLAL 

GAGLVPEELPPSRGGLGEALGAVELSLSEFLL 

LFTTAGIYVDGAGRKSRGHELLWPAAPMGW 

GYAAPYLTVFSENSIDVFDVRRAEWVQTVPL 

KX\ VRPLNPEG SLFL YGTEK VRLT YLRN QL AE 

KDEFDIPDLTDNSRRQLFRTKSKRRFFFRVSE 

EQQKQQRREMLKDPFVRSKLISPPTNFNHLV 

H V (jr AN uKJ^OARDKbr 


100 


1450 


A 


1318 


918 


190 


SLCVPGPVDTGTFAVMSVMVGSVTESLAPQA 

LNDSMINETARD.AARVQVASTLSVLVGLFQV 

GLGLIHFGFVVTYLSEPL\niG > TTAAA VQVF 

VSQLKYVFGLHLSSHSGPLSLIYTVLEVCWKJL 

PQSKVGTVVTAAVAGVVLVVVKLLNDKLQQ 

QLPMPIPGELLTLIGATGISYGMGLKHRFEAGV 

PPVAPNTQLFSKLVGSAFTIAVVGFAIA1SLGK 

IFALRHGYRVDSNQVWVMRDV 


101 


1451 


A 


1353 


220 


445 


DWPDLFTYPLIGSPKCFQSARPENRMYRRl VR 
SSHGNHALQEVLPRSGHGTEFTKQKFILEAAD 
HGHPPARMSIFSR 


102 


1452 


A 


1363 


542 


2 


AHLLMLNLAL\TDLL\YLTSLPFLIHYYASGEN 

WIFGDFMCKFIRFSFHFNLYSSILFLTCFSIFRY 

C VI LHPMSCF SIHKTRC A V V ACA V VWU SL V A 

VIPMTFLITSTNRTMRSACLDLTSSDELNTIKW 

YNLILTAVLLCLPLVP/TLCYTTIIHTLTHGHAN 

\DSCLKQKARRLTILLL 


103 


1453 


A 


1371 


2 


410 


CHSTESSSDFILPGDYLLGGLCPLHSGCLQVXC 
SFNEHGYHLFQAMIU^AVEEINNS 1 ALLPNIi L 
GYQLYDVCSDSANVYATLRVLSLPGQHH1EL 
QGDLLHYSPTVLAVIGPDSTNRAATTAALLSP 
FLVPMLLEQ 


104 


1454 


A 


1376 


3 


432 


NSRVEDRS/NMSLWTQNITVCPVRNVTRDGG 

RPRCGGLDCLGPAIHIANCSRNGAWTPWSSW 
ALCSTSCGIGFQVRQRSCSNPAPRHGGRICVG 
KSREERFCNENTPCPVPIF 


105 


1455 


A 


1379 


2 


396 


GLGLLYLIFAAVEGVMRVIGGSNHLAVVLDD 
11LA VlUSIr V WrlrJiJLAQI MKl IwKJLKJvN I VKi* 
SLYRHFKNTLIFAVLASIVFMGWTTKTFRIAK 

RPSA 


106 


1456 


A 


1383 


1 


432 


EDGHGG WSSRCLNO)HAEEGHREPWKRI .CI W 
QRGGHEIRFAFYFPGHPLLSPQICLAPETPPRG 

FEIQPQGSPPFLWCKMTSGTFWTCRTDSRVF 
QNANPSNAAHSEDQPTP 


107 


1457 


A 


1386 


719 


558 


FFFVTRSHSVAQAECSGVFTAHRSLDLVGSSN 
YRALSLQSSWDHRHTWLTFAFL 


108 


145S 


A 


1397 


631 


2 


RVAISLLCAAIF1SFMVQSAGKRWPTGVMLM 
VVVLFAFLYSWPIQALLPTYLKTDLAYNPHT 
VANVLSFSGFGAAVGCCV/GGFLGDWLGTRK 
AYVCSLLASQLLIIPVFA1GGANVWVLGLLLF 



144 



WO 01/57188 



PCT/U SO 1/03800 



SEQ ID i 
NO: of * 
nuel- i 
eotide $ 
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,EQ ID I 
sJO: of 1 
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lod I 
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>eginning r 
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-esidue of 
peptide 
sequence 


'redicted end / 
lucleotide 1 
ocation 1 
:orresponding 1 
o last amino I 
icid residue < 
}f peptide 
sequence ; 


Smino acid sequence (A= Alanine C-Cysteme, 
)=Aspartic Acia t ii— uiuianiK, /\ciu, 
^Phenylalanine, G=Glycine, H=Histidine, 
=Isoleucine, K^l ,ysine, L=I xucine, 
vl^Metnionine, in— /vsparaginc, r a iwimw, 
3=Glutamine, R=Arginine, S=Serine, 
n-Threonine ; V=Valine, \V=Tryptophan, 
^Tyrosine, X^Unknown, *=Stop codon, 
=possioie nucieouue uciciivjhj \ ^umw^ 
nucleotide insertion 














FQQMLGQGIAGILPKLIGGYFDTDQRAAULU 
FTYNVGALGGALAPI1GALIAQRLDLGTALAS 


109 


1459 


A 


1402 


15 


387 


VLVALPDTNVTSETVVTEVLGHRVTLPCL Y SS 
WSHNSNSMCWGKDQCPYSGCKEALIRTDGM 
RVTSPJCSAKYRLQGTIPRGDVSLTILNPSESDS 
r-\r\7nnr> TPVPnWFTsinVKTNVRLNLORASTT 


110 


1460 


A 


1421 


3 


350 


HEDLSSLLTRGSGNQERERQLKKLISLIU^WM 
LAELAFPVGVLATCA*SLLSC*YCVILFPCSCF 
FFHSPDALFSLLLLSCYFPSYCFFYYLFFSSSPL 


111 


1461 


A 


1426 


2 


344 


FTSTMTKPFEKESEQPA*ATLAFGAQTSTTAD 
QCALKPDLSYLNNSSSSSSTPATSAGGGIFGSS 
Toooxinm; a ^cwt^Pi^^t^pvsS YGFVNTAESST 
SDSLLFSQDSKLATTS 


112 


1462 


A 


1434 


46 


372 


"TTSWTTSCTRSCT*SGASSGPGWTPRTrwWK 
SRRSSQRTCSRACSGAWSRTW*RSS*TSSSSC 
STSCSSSSSRSCGRPGGPLGARGVHITSCLNSC 

MSSSTTSS1 iMr 


113 


1463 


A 


1439 


3 


292 


HEDIMTHYDRLVDF*ALNAGKQRYEKJV11^0 
M YLGEI VRNIL1DFTKKGFLLRGQISEMLKTR 
GIFLTFLLSNFLIVCVLLFYVSFYLFQSCINFVL 

T , • t fnrnuccTl — PPPv POPfYNf W Y DODLLNLO 


114 


1464 


A 


1463 


1 


396 


KQQAVPEPHbbl 1 t Jr*s,?t-^i-W 1N w i vjv j - yLjiJlNi -' v < 
QRTKVHLPGHKTGPAVAKDTPEPVKKEFTVP 
ATSQGP*SPFSEEPPLPPSNEEVPPTLPP*EPQS 
EDP*KNA*LKQMHAATTHWQQHQQHQVGC 

OYHGIMQ 


115 


1465 


A 


1464 


291 


2 


" AGSYPSMVWSCHWGVTQKRRAL* V YSKtfEG 
GRRKCGQYWPLEKDSR3RFGFLTVSNLGVEN 
MNHYKKSTLEILNPEVNPGFFFLTLWKQGEN 

NY™ 


116 


1466 


A 


1465 


667 


337 


LPPQRPA* lDblZ>101N VooUrLAUyjnnuiuy 

YWTKYQVWEWLQHFLDTNQLDANCIPFQEF 
DINGEHLCSMSLQEFTRAAGTAGQLLYSNLQ 
HLKWNGDSLFLCLSLPC 


117 


1467 


A 


1479 


1 


381 


GTSGGPKRVLVTERFPWQNPLPVNKUQA^K 
, n /-dcmct:y^d\/pi nAOk'l VSSHKPGONOKHK 
QLQATSVPHPVCMPLNNTQKSKQPLPSAPEN 
NPEEELASDPNNEESL*RPWALEDFEIGRPLG 

KGK 


118 


1468 


A 


1485 


3 


385 


" "TYLWL*GNPPFYEICNDGGLFELILKA MJ£,h im h 

m/n/nnA/fCnC A k"T-TFTT?Pf TrjRDP*KPFPCDOPL 

QHPWIEGHTCLDNNIHQAASEPINNNFAESKR 
NLAFLATGVVRHMRKLFMGANLEGPGPTVS 


119 


1469 


A 


1486 


1 


398 


- " GTTSKHH+LARSLIRGPFDHDLKPNAATRDQL 
NIIVSYPPTKQLTYEEQDLGWKFRYYLTNQE 
KALTKFLKWVNWDLPQEAKQALELLGKWK 
PMDVKDSLELLSSHYTNPTVRRYAVARLRQA 

DDEDLLMYL 


120 


1470 


A 


147 / 




999 


" MGESPAV+GYFVLAGMNSAGLSFUUUAGKY 
LAEWMVHGYPSENVWELDLKRFGALQSSRT 
FLRHRVMEVMPLMYDLKVPHWDFQTGRQL 
RTSPLYDRLDAQGARWMEKHGFERPKYFVP 
PDKDLLALEQSKTFYKPDWFDIVESEVKCCK 
' EAVCV1DMSSFTEFEITSTGDQALEVLQYLFS 
NDLDVPVGHIVHTGMLNEGGGYENDCSIARL 
NKRSFFMISPTDQQVHCWAWLKKHMPKDSN 
LLLEDVTWKYTALNLIGPRAVD\O.SELSYAP 
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SEQ ID 

vsU. 01 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

iNU. 01 

peptide 
seq- 
uence 


Met 
noo 


SEQ 
rpv xrrv 

IJL> In W. 

in 

09/496 
914 


Predicted 
beginning 
nucleotide 

IVLaUVU 

correspondi 
ng to first 
amino acid 

peptide 
sequence 


Predicted end 
nucleotide 
location 
correspond ing 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G^GJycine, H=Histidine, 
I— Isoleucine, K=Lysine, L— Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan, 
Y = Tyrosine, X— Unknown, *—Stop codon, 
/—possible nucleotide deletion, \=possib!e 
nucleotide insertion 














\yfTPni-IPPCT R/^fr'FK/IQVnV A~KinTT>\/AAQKATUT 
lvi i r utter oLrOiSJ^ivio v \j i /vinvjXK vivloJVl i ri l 

GEPGFMLYIPIEYRWGFTMLSTLVSNS 


121 


1471 


A 


1498 


3 


306 


AQFLLVGWDHIL*LIVL*TNLTELGRTTCDQN 
WPNSPDVLNHGCFYN4QCLSKDCTIGYVSRE 

ivi.i j v M_rJ i rli v c.nrl 1 u l rlL.^ i v i w i^Urlo V 1LJD 

SSDFVEFEN 


122 


1472 


A 


1533 


121 


329 


LGLFSF VWTE VLEEPKDFSCETEDFK.TLH CT 

WDPGTDTALGWSKQPSQSYTLFES*VGSGYII 

DNFFLA 


123 


1473 


A 


1547 


111 


40S 


DARTTWKPRNGSSGIWPGDGAK*PPAVEQAE 
RGHVEMIEKJLTFLNLHTSEKDKGGNTALHLA 
AKHGHSPAVQVLLAQWQDINEMNEKQQTPL 
ITVAADRG 


124 


1474 


A 


1555 


1 


745 


MTFDDDDKNTTGVALVWKKFQTQSLRLSDL 

HRKSHLWRGIVSITL1EGRDLKAMDSNGLSDP 

YVKFRJLGHQKYKSKIMPKTLNPQWREQFDF 

HLYEERGGVIDITAWDKDAGKRDDFIGRCQV 

DLSALSREQTHKLELQLEEGEGHLVLLVTLT 

ASATVSISDLSVNSLEDQKEREEILKRYSPLRI 

FHNLKDVGFLQVKVIRAEGLMAADVTGKSD 

PFCVVELNNDIU-LTHTVYKNLNPEWNKVFTL 

* V ALV WKKFQTQSLRLSDLHRKSHLWRGI VS 

ITLIEGRDLKAMDSNGLSDPYVKFRLGHQK.Y 

KSKIMPKTLNPQWREQFDFHLYEERGGVIDIT 

AWDKDAGKRDDFIGRCQWLSALSREQTHK 

LELQLEEGEGHLVLLVTLTASATVSISDLSVN 

SLEDQKEREFJLKRYSPLRIFHNLKDVGFLQV 

KV1KA£UJLMAA_UV I OKbDrrC VVhLNNDKLL 

THTVYKNLNPEWNKVFTL 


125 


1475 


A 


1556 


57 


509 


GGPAPN SRY AEP* KN S L AMT* AHADCENY V A 
CGGLDN1CS1YNLKTREGNVRVSRELPGHTGY 
LSCCRFLDDSQIVTSSGDTTCALWDIETAQQT 
TTFTGHSGDVMSLSLSPDMRTFVSGACDASS 
KXWUiivlJuMLKyor 1 UHVbUINAVo 


126 


1476 


A 


1592 


3 


178 


KSEKSCVSSLAHFGTSCQRDYDAMVKLVETL 
EMLPTCDLADQHNIKFHYAFALNR*ER 




1 H- 1 f 


A 
r\ 


l O I J. 


1 
l 


<\yl 


1 bbrl^LVKPYLPYJTK^cLHAJMTAGFSTIAGS 

VLGAYISFGVPSSHLLTASVMSAPASLAAAKL 

FWPETEKPKJTLKNAMKMESGDSGNLL*AAT 

QGASSSISLVANIAVNLIAFLAJLLSFMNSALA 

WVGNMFDYPQLSFELICSYIFMPFSFMMGVE 

wrujnvi 


128 


1478 


A 


1619 


286 


486 


CCMN SICAQES VFKN VLCNPPALSEMPD VKA 
EDEVDFRASSISEEVAVGSIAATLKMKQGPM 
TQAINR 


129 


1479 


A 


1627 


1 


395 


PTRGALRYWIFGRFLCNIWAAVDVRCCTAT1 
MUJLCllbllXKY VUVbYrLRir rJV 1 QRKuLMA 
UXVW ALSLVIYIGPLLG WRHPAPEDETICQI 
NEEPGYVLFSTPGSFYLPLAIMLVMN*RVYRV 
AKTE 


130 


1480 




1638 


2 


*-HJU 


n PR VR TV TVW R T<T TTI VPTAn W TY^QN/f A WflV d 
L/trlK Y Iv 1 ivi > IN InJS. 1 ill Hlv^UiV * vJOlYLA. V V OJvvJ 

ECHNfPCEKGDKLRLFCFRLRKRENIV'lSBCJLMS 
EMH SFIQ1 QKNTNQR SHDSRSMALPQEQSQHP 
KPSEASlTLPESHLK'lVQMPPriPSSSSFTKVT 
KDKDIK*LLFNLYSSVEILPEVLHLKT 


131 


1481 


A 


1651 


607 


3 


LAEGGDVFDCVLNG GPLPESRAKALFRQM VE 
AIRYCHGCGVAHRDLKCENALLQGFNLKLTD 
FGFAKVLPKSHRELSQTFCGSTAYAAPEVLQ 
GIPHDSKKGDVWSMGWLYVMLCASLPFDD 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyIalanine, G=Glycine, H=Histidine, 
T=lso1eucine, K=Lysine, I.=l.eucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine 3 W=Tryptophan, 
Y^Tyrosine, X=Unkno\vn, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














TDIPKMLWQQQKGVSFPTHLSISADCQDLLK 
RLLEPDMILRPSIEE VS WHPWLAST* * KQ WQ V 
LSNKVGGESKPKKRK 


132 


1482 


A 


1656 


150 


48 


LVAKSLLYCGCLFFLLQLAKNVGNNSFNDIM 
EAKLTSPSPKPTPSSDM*VFLIY*TYFGAWHV 
VDAQ 


133 


1483 


A 


1660 


3 


406 


RKHIICLLIQICLSDVP*ECQNNQL*ICLTEICEKE 
K30EFK3CKMDDQRPEKITEA* SKDKSPMEEEK 
TEMIRSYIQEVGRYIICRLEEAQSKRLEKLREK 
HKEIRQPILDEKPKGEGSSSFLSETCHEDTSWF 
PNFTP 


134 


1484 


A 


1666 


1276 


466 


PGSTHASARITIY*L*IILSNATEVDNNFSKPPP 

FFPAGAPPASSSSSSSSSSPPTVSTAPPLIPPPGF 

PPPPGAPPPSLIPTIESGHSSGYDSRSARAFPYG 

NY AFPHLPGS APS WPSLVDTSKQ WD Y Y ARSS 

SSSSSSSSSSSSSPRDRDRER*RTRERERERDHS 

PTPSVFNSDEERYRYREYAERGYERHRASRE 

KEERHRERRHREKEETRHKSSRSNSRRRI1ESE 

EGDSHRRHKHKKSKRSKEGKEAGSEPAPEQE 

STEATPAE 


135 


1485 


A 


1673 


1 


417 


PTRPVNSSQAFALVYYTLGALGGNUAHMGL 
GYRYWAGiGVLQSCESALTHYRLVANHVAS 
D1SLTGGSVVQRIRLPDEVENPGMNSGMLQE 
DLIQYYQFLAEKGDVQAQVGLGQLHLHGGR 
G V* QNHQRAFD YFNL AA 


136 


1486 


— 


1678 


525 


9 


ANTSLSSAAVSAVSPPPCRTSTATTLPPPMPSF 

FCVFPSPSMSPSPSEFLSC1ASVSRVHSLSSSSS 

GSSSTASSLNFSAIMGSSSATASWVLSTASTPP 

CPSALPSSPAQES*SLAASSSAWPVAGISPSGA 

CTFPAGSASGAAKAPSPSWRCPSFRALFSLLD 

SSSLSL 


137 


1487 


A 


1680 


1 


2999 


AHRDEIQRKFDALRNSCTVITDLEEQLNQLTE 

DNAELNNQNFYLSKQLDEASGANDEIVQLRS 

EVDHLRREITEREMQLTSQKQTMEALKTTCT 

MLEEQVMDLEALNDELLEKERQWEAWRSVL 

GDEKSQFECRVRELQRMLDTEKQSRARADQ 

RITESRQWELA\TCEHKAEILALQQALKEQK 

LKAESLSDKXNDLEKJCHAMLEMNARSLQQK 

LETERELKQRLLEEQAKLQQQMDLQKNHIFR 

LTQGLQEALDRADLLKTERSDLEYQLENIQV 

LYSHEKVKMEGTISQQTKLIDFLQAKMDQPA 

KKKKWLQYNELKLALEKEKARCAELEEALQ 

KTRIELRSAREEAAHRKATDHPHPSTPATARQ 

QIAMSAIVRSPEHQPSAMSLLAPPSSRRKESST 

PEEFSRRLK£RMHHNIPHRFNVGLNMRATKC 

AVCLDTVHFGRQASKCLECQVMCHPKCSTC 

LPATCGLPAEYVTHFTEAFCRDKMNSPGLQT 

KEPSSSLHLEGWMKVPRNNKRGQQGWDRK 

YIVLEGSKVLIYDNEAREAGQRPVEEFELCLP 

DGDVSIHGAVGASELANTAKADVPY1LKMES 

UPUTTPWPnR TT YLLAPSFPDKORWVTALES 

VV AGGRV SREKAE ADAKLLGNSLLKLEGDD 

RLDMNCTLPFSDQVVLVGTEEGLYALNVLK 

NSLTHVPGIGAVFQIYIIKDLEKLLM1AGEERA 

LCLVDVKKVKQSLAQSHLPAQPDISPNIFEAV 

KGCHLFGAGKIENGLC1CAAMPSKVV1LRYN 

ENLSKYCIRKEIETSEPCSC1HFTNYSIL1GTNK 

FYEIDMKQYTLEEFLDKNDHSLAPAVPAASS 

i JSFPV SI VQVN SAG QREE YLLCFHEFG VF VDS 
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'CT/USO 1/03800 



SEQ ID 

via. _r 
1NU. OI 

nucl- 
eotide 
seq- 
uence 


SEQ ID 
in<j. or 
peptide 
seq- 
uence 


Met 
noo 


SEQ 

ID INIJ. 

in 

T/SSN 
09/496 
014 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng io iirst 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=GIycine, H=Histidine, 
f=Isoleucine, K=Lysine, T ^Leucine* 
M=Methionine, N^Asparaginc, P=Proline, 
Q—Glutamme, R— Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrbsine, X=UnJcnovvn, *-Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














YGRRSRTDDLKWSRLPLAFAYREPYLFVTHF 

XTQI PVICIHADCC AP.TPAD A VT TM'DXrDl>VT PDA 

ISSGAIYLASSYQDKLRVICCKGNLVKESGTE 
HHRGPSTSRR+PASPLPQYQGQRAFLQGRRK 


138 


1488 


A 


1686 


2 


526 


GRPQGPAPGAGSPPESGPGLWAALGCSLVWV 
r CL OCj AAOKL * AKo uK b CjL Kl<i<J<AJrlAGPP 
PGGPCNSCP* CSAPESGGRGPLPGPGTGGVCS 
CWTRGCQTTARTAAAAAAPGPAGRRPPGGA 
PQNGSCAASASQEaAAPPPMCPPGRRWAVAS 

DDCTDPDA ADPTDPDDI r* a 

rrt 1 KLrAAru 1 KLKKLtAA 


139 


1489 


A 


1693 


3 


376 


LPSMSNCTSCFRLQSRTES*IRQAGHLLGRNE 

FIETKALGCAWFSLCYYLVLYFESSHKVDFVF 

IV*CFSTPPGAQMTIMSQACAERCN1MRLVDR 

KWAuiAJvUVU I V^i^lLiKVHLOtC^KAJ_,vjL 


140 


1490 


A 


1704 


3 


376 


ERTNKFIKELIMDGKNL1AATKSLSVAQRKFA " 
HSLRDFKFEF1GDAVTDDERC1DASLREFSNFL 
KNLEEQREIMVS*EGCKXISQLSRGKKIWIWK 
LVLVEVVKHLSLGTVVHCNGKMRFPEP 






A 

A 


1 *7/1 3 


I 


362 


LITT^KVFVARELSCLDVHLDSTGSTAV V ADQ 
DKLELELVLKGSYEDTQTSFLGTASAFRFH Y 
N4AAL*TELSGRLRSSKSNGWNGDNSTGYLTV 
PLRPLTIVKEVTMDVPAPNVRGLNWMG 


142 


1492 


A 


1769 


1 


406 


NNPSTLPRGS*PMSPRTTMGRRRQRRREHKSS 

LSLASSTVGPGGQIVHTETTEVVLCGDPLSGF 

GLQLQGGIFATETLSSPPLVCFIEPDSPAERCG 

LLQVGDRVLSINGTATEDGTMEEANQLLRDA 

ALAKKVV 


143 


1493 


A 


1789 


1 


447 


QMLRNGGDQNTVPDYHFADRJRELL*PTEDQ 
K.NCI1 *Di Yi^RrSAJLONI VCEVTHPCSPGPCPA 
NELCEVNRKGCTSGDPCLPYFCVQGCKLGQA 
SDFIARQGTLIQVPSSAGEVECYKICSCGQSGL 
LENCMEMHCMDLPTDTSALVR 


144 


1494 


A 


1814 


1 


404 


PGRRFRPRLSQAGTDSGS* VFPDSFPSAPAEPL 
r Y r LQbPQD A 1 1 VKNKJ^VELKCRAFPATQIYF 
KCNGE\WSQNDHVTQEGLDEATGLRVRE\0-I 
IEVSRQQVEELFGLEDYWCQCVAWSSAGTTK 

aKKA Y V Kl 


145 


1495 


A 


1827 


26 


448 


XVEEl<JlADTWRSXCLSDFFFHAAKXLCXE*N 
CGDAISLSVGDHFGKGNGLTWAEKFQCEGSE 
T11LALCPIVQMPEDTCIHSREVGWCSRYTDV 
l^VNGKSQCDGQVEINVLGHWGSLCDTHWD 


146 


1496 


A 


1828 


574 


333 


QHEGGDLRRRQLGEIQLTVRYVCLRAASAC* 
SMAAET* HHVPASG ADP YVR V YLLPERK W A 
CRKKTSVKRKTLEPLFDET 


147 


1497 


A 


1S55 


1 


372 


ERLVLTSEHCLVLTLFWPSWTYHTLLLSRQH 
VRRI.PKLTFL\EHDHLASIMNKLLTNYDNLFE 

tc \ jta.' o jt/— * * i.ip a dt/"* cr a /** A VI \ l l~K1X 1 *. jt 1 ft ATI 

1 bv i i 5>M(j ¥ HO AF I ObbAu AN W NH* * LHAH 
YYPPLLRSDTVRKFMVGSQMLAQAQRDLTPE 

Q 


148 


1498 


A 


1879 


568 


7 


IAHTYMAAEYLEKAGRKJXtVNVYVEKQGAN 

GIEGRLTADQLNSATAC1FAAEVAIKESERFN 

GIPALSVPVAEPIRHAEALMQQALTLKRSDET 

RTVQQDTQPVKSVKTELKQALLSGISFAVPLI 

VAGGTQVA*AV*RQGISSLHDVQVRTWNS 


149 


1499 


A 


1880 


611 I 


24 


GLNSENALSNEAMERGWQCLRLFAERLQDIP 
PSQIRWATATLRLAVNAGDFIAKAQE1LGCP 
VQV1SGEEEARLIYQGVAHTTGGADQRLVVD 
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WO 01/57188 
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SEQ ID 5 
NO: of 1 
nucl- I 
eotide i 
seq- i 
uence 


SEQ ID I 
sTO: of I 
peptide 
;eq- 
jence 


vlet J 

i 

} 


SEQ I 
DNO: t 
n t 
USSN 1 
19/496 ( 
?14 i 


>redicted 1 
>cginning i 
uicl eotide 1 
ocation < 
;orrespondi 1 
ig to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end j 
mcl eotide 
ocation 
:orresponding 
to last amino 
acid residue 
of peptide 
sequence 


\mino acid sequence (A^Alanine C-Cysteine, 
3— Aspartic Acid, b=Ulutamic /vcm, 
^Phenylalanine, G=Glycine, H=Histidine, 
=Isoleucine, K=Lysine, L=Leucine, 
M==Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, K— Arginine, 0— acinic, 
T=Threonine, V=V aline, W=Tryptophan, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














IGGASTELVTGTGAQTT*LFSLSMGCV I WLEK 
YF ADRNLGQENFD AAQKAAREVLRPV ADEL 
RYHSWKEVRGASVTVQALQEIMMAQGMDE 
RITMElwrVD 


150 


1500 


A 


1894 


2 


750 


GRVDFFHTDYRPLIRDSNNYVLDEQTQQAFH 

LMPPPFLVDVDGNPHPTKYQRLVPGRENSAD 

EHLIPQLGYVATSDGEVIEQIISLQTNDNDERS 

PESSILDGMIRQLQQQQDQRMGADQDTTPRG 

LSNGEETPRRGFRRLSLD1QSPPNIGLRRSGQV 

EGVRQMHQNAPRSQIATERDLQAWKRRVVV 

PEVPLGIFRKLEDr KJ^titvOniiiiiviN 1 . 1 uvjivivxvtv 

TLOLSHKSDSVGLVSQSRPRTCRRKYP 


151 


1501 


A 


1900 


141 


785 


GKTIQIQTTMQNKYKTVQKQYKTIPKNKJ^A 
MEMQIKKQFQDTCKVQTICQYKALKNHQLEV 
TPKNEHKTILKTLKDEQTRKL AILAEQ YEQ SI 

NEMMAS Q ALRLDb AV^ r- AX/U ^/vlim-v L W DIV1 
ELLNAYQSKIKMQTEAQHERELQKLEQRVSL 
RRAHLEQKIEEELAALQKERSERIKNLLERQE 

REIETFDMiioJLKJVlvjrvjlNlw v 1 LrLrrjvDL' 1 ^ 


152 


1502 


A 


1915 


2 


377 


LVRLLDTQRDGLQNYEALLGLTNLSGRyDKi. 
RQKIFKERALPDIENYMFENHDQLRQAATEC 
MCNMVLHKEVQERFLADGNDRLKJLVVLLCG 

rrvrvrvi/- \ft~VKl A A A ( i A 1 AN/11 TA AHTCKLCL.KA4T 
nVTT 


153 


1503 


A 


1921 


I 


237 


AYQSLRLEYLQIPPVSRAYTI'ACVLTSAAV QL 
ELITPFQLYFIPELIFKHFQIWRL1TNFLFFVPFG 
FNFLLYMIFLYT 


154 


1504 


A 


1928 


2 


354 


EMVEGGEGKMCINTEWGGFGDNCiCiUUlKl K 
YDTEVDEGSLNPGKQRYEKMTSGMYLGEIV 
r»/-\Tt mi wnn i Ft?r;nT<sFRT RTRGIFETKFLS 
OIESDRLALLQVRRILQQLGLD 


155 


1505 


A 


1929 


2 


369 


"TEIAKIKMEAKKKYEKELTMFQNDFEKACQA 
t^ott at \n t>-c-t^^xt PT?TT-nvTTf>FTFTKEIYAOR0 
LLLKDMDLLRGREAELKQRVEAFESYQLELK 
DDYIIRTYRLIEDDRINIOISGHWQESP 


156 


1506 


A 


1935 


1 


270 


VTRKLPIFI VD AFT ARAFRG SP AADCLLENEL 
DEDMHQKIAREMNLSETAFIRKLHPTDNFAQ 

RSC/ruLIWr l.rl I jJL,vjil. i ooiL-r on ■> 


157 


1507 


A 


1936 


584 


305 


" ESKVNNEKFRTKSPKPAESPQSATKQLUQP1A 
\YEYYDAGNHWCKDCNTICGTMFDFFTHMH 
NKKHTOG0FQKSSDFQBCEELO0TFLPPERQG 


158 


1508 


A 


1939 


1 


423 


" TTHRLNVTAEPPCTSMPIYWMPDVPHRCTTA 

^ 1 „„ T ^, Tf-wrs-* a rxKinw f^l WfrFLPYGSLED 
NTCP V DL 1 L/iCAl^Nur 1 *-i-< v i uriji i ^^^^ 

RLHCQTQACPPLSWPQRLD1LLGTARA1QFLH 

QDSPSLIHGDIKSSNVLLDERLTPKLGDFGLA 

RFSRFAGSSPIQSSM 


159 


1509 


A 


1974 


3 


401 


HTSTARLLLHRG AGKEAVTSDG Y'l ALHLAAK 

NGHLATVK-LL. V nilis AU V .u,rvi\Ajr l^in v^/ i r*i_La 

AAAHGHSEVVEELVSADVIDLFDEQGLSALH 
LAAQGRHAQTVETLLRHGAHINLQSLKFQGG 
IIGPAATLLR 


160 


1510 


A 


1982 


2 


417 


KFLKDLEKQYNKEEPHLSEIGSCFLQNQEGFA 
IYSEYCNNHPGACLELANLMKQGKYRHFFEA 
CRLLQQMIDIAIDGFLLTPVQKJCKYPLQLAEL 
LKYTTQEHGDYSNIKAAYEAMKNVACLINER 
KRKLESIDKIA 


161 


1511 


A 


1984 


4 


770 


RETGS VSLSPSGLEG AESY AVSPIL Y SSPD VKE 
LWLETLQGQRHSHTGVKSTPGQSAAILMKLR 
| SSHNASKTLNANNMETLIECQSEGDITXEHPLL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=vYlanine C=Cysteinc, 
D=Aspartic Acid, E=G!utamic Acid, 
F=Phenylalanine, G=Glycine. H=Histidine, 
J=Isoleucine, K=Lysine, L=Leucine J 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine ? 
T=Threonine 3 V=Valine, W=Tryptophan, 
Y=Tyrosine, X^Unknown. *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














ASCESEDSICQLIEVKKRKKVLSWPFLMRRLS 
PASDFSGALETDLKASLFDQPLSIICGDSDTLP 
RPIQDILTILCLKGPSTEGI FRRAANEICARKEL 
KEELNSGDAVDLERLPVHLLAVYFKDFLRSIP 
RKLLSSDLFEEWMGALEMQDEEDRIEALK 


162 


1512 


A 


1986 


864 


501 


LLNSGLFSAPDGSNLEMRLTRGGNMCSGRIEI 
KFQGRWGTVCDDNFNIDHASV1CRQLECGSA 
VSFSGSSWGEGSGPiWFDDLICNGNESALWN 
CKHQGWGICHNCDHAEDAGVICSSKD 


163 


1513 


A 


2001 


419 


187 


AVDLSIDESSLTGETTPCSKVTAPQPAATNGD 
LASRSN1AFMGTLVRCGKAKGVVJGTGENSE 
FGDIINLSTFVVHS 


164 


1514 


A 


2012 


284 


597 


SLLCLFPGTSTVVCKPIVIETQLYVIVAQLFGG 
SHIYKRDSFANKFIKIQAIEILKIRKPNDIETFKI 
ENNWYFWADSSKAGFTTIYKWERETGFYSH 
QSFTR 


165 


1515 


A 


2013 


2 


403 


EDPEELGHFYDYPMALFSTFELFLTIIDGPANY 
N VDLPFM YSIT Y AAFA 11 ATLLMLNLL I AMMG 
DTHWRVAHERDELWRAQIVATTVMLERKLP 
RCLWPRSGICGREYGLGDRWILRVEDRQDLN 
RQRIQRYA 


166 


1516 


A 


2019 


2 


927 


CCQREGLGLKAWQILLSHGRNGLPGEPASS 

OGLSAASSTPVFHLALOIDSAPDNIDWVFM1 V 

NKNMVTERLQNVMYLEQCFSDSSSLYRFLTY 

S YLLAFNVWLLL AP VTLCYD W Q VGSIPL VET1 

WDMRNLATIFLAVVMALLSLHCLAAFKRLE 

HKEVLVGLLFLYFPFIPASNLFFRVGFVVAER 

VLYMPSMGYCILFVHGLSKJLCTWLNRCGATT 

LIVSTVLLLLLFSWKTVKQNEIWLSRESLFRS 

GVQTLPnN.AJ<.VHWYANFLKDQGRNI<£AIY ! 

HYRTALNNNKAWDYLCWRFRKTLTDLP 


167 


1517 


A 


2025 


696 


71 


AAASAASSLTVTLGRJ.ASACSHSILRPSGPGA 

ASLWSASRRFNSQSTSYLPGYVPKTSLSSPPW 

PEVVLPDPVEETI^HAEVVKKVNEMIVTGQY 

GRLFAWHFASRQWKVTSEDLILIGNELDLA 

CGERIRLEKVLLVGADNFTLLGKPLLGKDLV 

R VEATVIEKTE S WPRIIMRFRKJUCNFKKKRI V 

TTPQTVLRINS IEI APCLL 


168 


1518 


A 


2046 


2 


366 


HLQVAARVFMPLQAVDSAPKPLKGQAQAPQ 
RLQGAARVFMPLQAQVKAKASKPLQMQIKA 
PPRLRRAARVLMPLQAQVRAPRLLQVQSQVS 
KKQQAQTQTSEPQDLDQVPEEFQGQDQVLR 


169 


1519 


A 


2049 


1 


945 


QNLEDREVLNGVQTELLTSPRTKDTLSDMTR 

TVEISGEGGPLG1HVVPFFSSLSGRILGLFIRGI 

EDNSRSKjREGLFHENECIVKINNVDLVDKTFA 

QAQDVFRQAMKSPSVLLHVLPPQNREQYEKS 

VIGSLNIFGNNDGVLKTKYPPPVHGKSGLKTA 

NLTGTDSPETDASASLQQNKSPRVPRLGGKPS 

SPSLSPLMGFGSNKNAKKIK1DLKKGPEGLGF 

TVVTRJDSSIHGPGPIFVKNILPKGAAIKDGRLQ 

SGDR1LEVNGRDVTGRTQEELVAMLRSTKQG 

ETASLVIARQEGHFLPRELVMFRSQSH 


170 


1520 


A 


2050 


363 


1 


PVATHLTKILNSDEHAVVISSAICTLCETV1CDF 
VAKVEKTYDKTLENAVVADAVASKCSVLNE 
KXEQLLQALHTDSQAAPVLPGLSPLIVEEDAV 
ESSSEESLGESKEQLGDDVTKPSSQKA 


171 


1521 


A 


2055 


139 


675 


IPSRPWLGRITGLDPAGPLFNGKPHQDRLDPS 
DAQFVDYIHSDTDALGYKEPLGNIDFYPNGG 
LDQPGCPKT1I.GGFQYFKCDHQRSVYLYLSSL 
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seq- 
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SEQ ID 
NO: of 
peptide 
seq- 
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Met 
hod 
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ID NO: 
in 
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09/496 
914 


Predicted 
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nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine s 
r_)=A.spartic A.cia, ii — vjiuiamic /i^iu, 
F=Phenylalanine, G=Glycine, H=Histidine, 
l=!soleucine, K=Lysine ; L=Leucine, 

X A \ A a.tY\ inninp T\I= A croMrrinP P— Proline 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, \V=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 

POSbJ UlC IJULiCUUUC Uvlwllvll) i }j*jjoiisi\* 

nucleotide insertion 














RESCTITAYPCDSYQDYRNGKCVSCGTSQKE 
SCPLLGYYADNWKDHLRGKDPPMTKAFFDT 
AEESPFCiVmnfFVDUTWNKNVR 


172 


1522 


A 


2056 


3 


361 


t t Auif c a vp v a n^HT <nT v^mpkfshkcsepk 
ME\VTCVKIRSDGTRYITKRPVRX)RILKERALKI 
KEERSGLTTDDDTMSEM1CMGRYWSKEERKQ 
HLA^RGKEQRRRREFMMRIRLKCLKES 


173 


1523 


A 


2060 


1 


387 


A T nu 0K/frVTOT?vr5PnPTT}T9FHA/[ A A AGVFAT I. 
O 1 l^EolVlv-Lr r Vvjryrm 1 oEiFUVLr-vrw-vvj \ i 

QAYAFLQYLRDRLTKQEFQTLFFLGVSLAAG 
AVFLSVIYLTYTGY1APWSGRFYSLWDTGYA 
KlHlPllASVSEHQPrrWVSFFFDLHlLGCTFPA 


174 


1524 


A 


2071 


74 


443 


LLMGPl<AlCKSGSiaaaCVTKAERLIsXLQEEEE 
RRLKJEEEEARLKYEKEEMERJLEIQRIEKEKW 
HRLEAKDLERRNEELEELYLLERCFPEAEKLK 
QETKXLSQWKHYIQCDGSPDPSVAQEMNT 


175 


1525 


A 


2083 


139 


486 


AALT W SQPQEF WPMEMQPIVTDMVT VH W V 
AESSTVGWLCALFRVTIWGVGATGHGWCG 

RJ^VECCjJ_<"L/r br AxJVlJrlMoJLrl^Vjnolvrviiiv^ v v 

RLQFPYLEPGfiELPATTLLAFLAAV 


176 


1526 


A 


2092 


3 


587 


EGSVNFICFGVLFAiCDGQLlDDEMFSNElGSEP 
FQKFLNLLGDTITLKGWTGYRGGLDTKNDTT 
GIHSVYTVYQGHEIMFHVSTMLPYSKENKQQ 

-v rr-T) T/^oTLrrr^xir^n/TTX/POPriPF ^ AFK P^MTR S 
V El^vKnivjiNLH V liVrviC/ULiyoorru j\.rojviu\o 

HFTH1FALVRYNQQNDNYRLKIFSEESVPLFG 

PPLPTPPVFTDHQEFRDFLLVKLINGEKATLET 

PCI 


111 


1527 


A 


2103 


44 


427 


GKGQVSLEGRPHRGPLCLGSWWPGSRVPGC 

r^r^r* \ \\n a \*v A rwVPr.MHPP^P A ^ A A AT T G 

CSVSTACLCVPLCSGSPLAPFRRTAAJLQEGLR 
RAVSVPLTLAETYASLWPALQELARCGNLAC 
RSDEQ 


178 


1528 


A 


2104 


2 


409 


ALQo 1 LuAVWLULLLI\oLVviVV/\r-oivuy v ry 

PSTAASSEGAWEIFCNHSVSNAYNFFWYLHF 

PGCAPRLLVKGSKPSQQGRYNMTYERFSSSL 

LILQVREADAAVYYCAVEVPNTDKLIFGTGT 

r> r f^\/T7E>T\TTrYNJPri 

avLv V r r JNlv^* 


179 


1529 


A 


2111 


1 


312 


PTRSSTRPPSLFVHASAKGGEKEEGDDGHYL 
MRTESHTGLKXGGNANLWMLKJ^NTEPKKG 
SYHFDLERLRAAHILFEREQEHLAPGGISMPL 
PPPLPLPACLG 


180 


1530 


A 


2116 


3 


366 


" tcti'd a n:TTn\/TP Q FrrMD <? 9 F A WOOHD VOF 
i bilvKAiti 1 I My I K a r Li \v u jo c/i w Vdynu ^ V L 

LCRX^MFDALEQKWKQ'FEQADLINELYQGKL 

KDYVRSLECGYEGWRIDTYLDIPLVIRPYGSS 

Q AF AS VV CTFHLTAC VSLHR1HNST VV 


181 


1531 


A 


2117 


2 


386. 


YGLGAHFGRLFIQAGINENDFYDGAWCAGR 
NDLQQWIEVDARRLTRFTGVTTQGRNSLWLS 

JJW Viol ivv 1V1 V oiN VJ on 1 W v v vjivi> «Jovji--'iviii x_ 

GNSEKEIPVLNELPVPMVARYIRINPQSWFDN 
GSICI 


182 


1 COO 

1532 


A 

A 




i 


493 


RTKTDVYILNLAVADLLLLFTLPFWAVNAVH 
GWVLGKIMCKITSALYTLNFVSGMQFLACISI 
DRYVAVTKVPSQSGVGKPCWIICFCVWMAAI 
LLSIPQLVFYTVNDNARCIPIFPRYLGTSMKAL 
IQMLEICIGFVVPFLIMGVCYFITARTLMKMP 

Niias 


183 


1533 


A 


2140 


3 


561 


rqawheafkvrkjeiltviccllafciglifvq 
rsgnyfvtmfddysatlplliwileniavcf 
vygidk^edlkdmlgfapsryyyymwkyi 
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SEQ ID 

"M/"V ~C 

NU: ot 
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eotide 
seq- 
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SEQ ID 

jNU. 01 

peptide 

seq- 

uence 


Met 
nod 


SEQ 

ID iN(J. 

in 

USSN 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to jast anjino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, Jb— (jJutamic Acid, 
F=Phenylalanine, G=Glycine, H^Histidine, 
I =Iso leucine, K=Lysine, L=Leucine, 
ivi— ivteinionine, in— /isparagine, r^iroiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine } V=V aline, W-Tryptophan, 
Y=Tyrosinc 3 X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














EEFLSYPTWGLAVCASLDVFAILPVPVAFIGR 
RFSLIDDGAGPFCSAAYTTTGCRTPYL 


184 


1534 


A 


2145 


3 


538 


HELTVAAADRGQPPQSSWPVTVTVLD\ / ND 

NPPVFTRASYRVTVPEDTPVGAELLHVEASD 

ADPGPHGLVRFTVSSGDPSGLFELDESSGTLR 

LAHALDCETQAJRHQLVVQAADPAG.AHFALA 

PVTIEVQDVNDHGPAFPLNLLSTSVAENQPPG 

TT \/THTT TJAinnriAnAPnDT T? VU1 


185 


)535 


A 


2151 


2 


671 


LDKLLDRMENYNIFNEYILKQVAATY1KLGW 
PKNNFNGSLVQASYQHEELRREVIMLACSFG 
NKHCHQQ ASTLISD WIS SNRNRIPLN VRDI VY 
CTGVSLLDEDVWEFIWVUCFHSTTAVSEKKIL 

DVIIHVARNPHGRDLA\\^KFFRDKWKILNTRI 

RQKTLEFDFAEPLILAFPIILYTAIDNPP1 ,VREH 
p 


1S6 


1536 


A 


2153 


2 


400 


GPMCDKHSAFAEKFHAGF1DYIVHPLWETWA 
HLALPDAQDILYTLEDNRNWVDSMIPQSPSPP 
LDEQNRDWQGLLENLHVELTLDEEDSEGPEK 
EGEGQTYFTSSKTLCGIVPQNTDSLGETGIHIC 
AHDKSP 


1 o / 




A 


Zl 


zz / 


44 Z 


r Ntr K V A bUbr IwfciN 5> SlwLlMlLPLRN A i QJbr 11K 

PGAVAYTCNPSTLGGWGGWITRSGVRDQPG 

QHGGTPS 


18S 


153S 


A 


2167 


3 


486 


AJhlLGGAWLTQRSLGSWAAPGPARAAKEVVA 
CIPQNQKjVfNIWRVfKTSKHLQLLSFVLGAVSP 
A V VVr i IvlM VJLvjnINO Y Vj VtJcAjlr 1 LL»MLAAoo 

MDDILAITGFNTCLSIVFSSGCARSSGSRNSKS 

LRTPLGTICEGCDDSSIFSHLDHSSKWSSTYG 

HSGA 


1 RQ 
l07 


1 

1 JJ7 


A 
/V 


Z I Oo 


z 


4 I Z 


brl^iMNQl 1 v^/L^riN 1 Irlsi^lVlrNJ^RoVJjLbYNKJL 

QALAPDLFHGLRJCLTTLHMRANAIQFVPVRIF 

QDCRSLKFLDIGYNQLKSLARNSFAGLFKLTE 

LHLEHNDLVKVNFAHFPRLISLHSLCLRRNKV 

AIVVSSLDW 


190 


1540 


A 


2179 


64 


399 


MRLNQNTLLLESFGXXRPYTSEHAPTYHQW 
IvlIvAJJbLLK W I 1 aJbr L I L.bJri n Y AMQK 1 WLhJJ 
AYECTFIVLDAEKRHAQPGATEESCMVGDVN 

1 FT TT>T FHT TT nPTFVT IAPP 


191 


1541 


A 


2190 


1 j 


469 


CLDRAAGIRIIERN VIYINETHTRI IRG WLARR 
LSWLFIQERDVHKGMFATNVTENVLNSSRV 
QEAIAEVAAELNPDGSAQQQSKAVNKVKXK 
AJCRILQEMVATVSPAMIRLTGWVLLKLFNSF 
FWlsJIOIT-ncnOT FMVKAATFTMT PT T FT PVT-IP 

SH 


192 


1542 


A 


2197 


26 


157 


PSIOCGGIIvLLLTGTQLYGRFGSAIAPLGDLDR 
DGYNGEGREEPY 




l 

l j*i j 


A 

A. 


ZZJO 


Z 




YTHSKGIMHIvDVKPLNILCNSPRNKVILADW 
GLAJiFYIIPMllKYSVliVATRYYKSPEILLDYE 
YYDYSLDIWAVGVILLELLTLKLHVFEGGDN 
EQ 


194 


1544 


A 


2241 


105 


409 


RKGVGKMPTSEGRPGQERSDWV'rSYKVMGS 
NDSHTWVTVKNGSGDMIFEGNSEKEIPVLNE 
LPVPMGAJIYIRINPQSWFDNGSICMRMEILGC 
PLPDPNNY 


195 


1545 


A 


2245 


1 


672 


MGVASDWTKRIEYQPGSGSMPLFPSIHLETCD 
GAVSSLQ1VTELQTNYIGKGCDRETYSEKSLQ 



152 



SDOCID: <WO 0157188A2 I > 

L 



WO 01/57188 



PCT/US01/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
MO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 1 
beginning i 
nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteinc, 
L)=Aspartic Acid, E=Glutamic Acid, 
F=phenylalanine, G=Glycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine ? N=Asparagine, P=Pro!ine ; 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V=V aline, W=Tryptophan, 
Y— lyrosine. A— unKnovvn. — oiup wuuuii, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














KLCGASSGIIDLLPSPSAATOWTAGLLVDSSE 
MIFKFDGRQGAKIPDGIVPKNLTDQFTITMW 

ALYVHNCRLVFLLRKDFDQADTFRPAEFHW 

KLDQQALAKVDGQPGKSITRQLQEMPVTIQG 

1SLKPS 


196 


1546 


A 


2256 


I 


396 


FRGTPVSGLTNRDTLAVIRHFREP1RLK T VKP 
GKVINKDLRHYLSLQFQKGSIDHKLQQVIRD 

tvtt DTTDPTTP A PD T^HF WOVDYNFTS VEOF 

KALEESGALLESGTYDGNFYGTPKPPAEPSPF 
QPDPV 


197 


1547 


A 


2259 


43 


594 


QLAIEIGVRALLFGVFVFTEFLDPFQRVlQPEiil 
WLYKNPLGQSDNIPTRLMFAISFLTPLAVICV 

•» rry TT T» T> TUFTC 1 1/ TT A PI AUQI Al AT NGVCTNTI 

V KIIRlxTDK 1 t/livt Ax LAV oL,rYJ^r\J-^NVJ v a in iA 

KLIVGRPRPDFFYRCFPDGVMNSEMHCTGDP 
DLVSEGRKSFPSIHSSFAFSGLGFTTFYLAGKL 
HCFTESGRGKSWRLCAAILPL 

-■ s-,-rrr-xr\ n rmn aVtt' xrrxni CFgtfTTQT PP^ATOIVfFF 


198 


1548 


A 


2275 


3 


404 


TCTT V W IPRMLL VJL>rL/ot-owi loLrc^Ai v^jivirr 

FLGFASNNCFIMAAMSYDRYTAn-INPLQYHT 
LMTRKICLQMMMASWMVGFLFSLCIIVTVFN 
LSLCDLNTrQHYFCDISPWSLACNYTFYHEM 
AlFVLSA 

— t rr-rn Vrrrri a 1 C A TCCTTI T A IV/f A P1~M} VV ATCT-l Pi 


199 


1549 


A 


2315 


1 


375 


LTQMFFIHALS Albb> l ibbAMAr ut\ i v /\.iv_^rxr i_> 
RHAAVLNNTVTAQIGIVAVVRGSLFFFPLPLLI 
KRLAFCHSNVLSHSYCVHQDVMKLAYADTL 
PNVVYGLTAILLVMGXDRMFISLSYFLI1 


200 


1550 


A 


2334 


2 


409 


PRVRPQQRFCMSFFFKTELGEKLVTKFLFETDF 
SDDPMLPSPDQLKKKAPFTNKKLKAHQTPVD 
ILKQKAHQLAbMQ V ^AYNOors/UNrru. /-vlninc 
EEEDEEDEYDYDYESLSDDNILEDRPENKSCH 
DQLQFEYKEEM 


201 


1551 


A 


2350 


3 


512 


ISWEAQIAEI1QWVSDEKDARGYLQALASKM 
TEELEALRSSSLGSRTLDPLWKVRRSQKLDM 
SARLELQSALEAEIRAKQLVQEELRKVKDAN 
LTLESKLKDSEAKNRELLEEMEILKKKMEEK 
FRADTGKLMLCD S ALFEYK YFSNECFYFLFD 
LIVTLEAPTEFQIQY 


202 


1552 


A 


2351 


1 


1003 


PSSYSSDELSPGEPLTSPPWAPLGAPERPEHLL 
NRVLERLAGGATRDSAASDILLDDIVLTHSLF 

T ^pt/pt rvcT unvr\/TJ AnnK/fFfTPFfrT CrRTCOA . 
LPTEKr LQfcLrH^ Yrv ivA.ouivij^ox i^vji^^x^-i^v^ 

CLAMLLHFLDTYQGLLQEEEGAGHUKDLYL 

LIMKDESLYQGLREDTLRLHQLVETVELKIPE 

ENQPPSKQVKPLFRHFRRJDSCLQTRVAFRGS 

DEIFCRVYMPDHSYVTIRSRLSASVQD1LGSV 

tcvt nvoECPAr.RPn^i IT VAV9SSGEKVLLO 

PTEDCVFTALGINSHLFACTRDSYEALVPLPE 

EIQVSPGDTEIHRVEPEDVANHLTAFHWELFR 

CVHELbr VDY vrxiuii 


203 


1553 


A 


2361 


2 


403 


NNLNCAEPLFEQNNSLNVNFNTQKKTVWLIH 

GYRPVGSIPLWLQNFVRILLNEEDMNVIVVD 

WSRGATTFIYNRAVKNTRKVAVSLSVHIKNL 

LKHGASLDNFHFIGGSLGAHISGFVGKIFHGQ 

LGRITGLDP 


204 


1554 


A 


2390 


280 


476 


" SPSLLPQCLMSLSDLSLSPAPPSHLSPRCPSPQ 
AGSRLGAMRRCAREMDATPMPPAPSCPSERV 

T 


205 


1555 


A 


2400 


543 


745 


AAVALRDISWQQPYPMDFYAGSSLGPWTVN 

HGQDRRPHAPGRPARGKVQEGSARPPSAVAC 

EDCSCR 
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seq- 
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Met 
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SEQ 
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Predicted 

ueginning 

nucleotide 

location 

correspond! 

no to firct 

amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 

i r» i H r p» c i H 1 1 p» 
dLIU ICMUUC 

of peptide 


Amino acid sequence (A=AIanine C=Cysteine, 
D— Aspartic Acid, E = Glutamic Acid, 
F=Phenylaianine, G=Glycine, H=Histidine. 
I=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 

t~~} — 1 1 i-f^t-vi ina 1J — A rcr ininP 1 C— ri nf* 

— o i u umiiiit, In. — /Arginiiic, o — "ocrinc, 

T=Threonine, V=Valine, W=Tryptophan, 

Y="Tvrr>^inp \ r — 1 1nL'nnwn ^— Qfr>o roHnn 

/=possible nucleotide deletion, \=possible 
nucleotide insertion 


206 


1556 


A 


2406 


122 


485 


DLSPDSREDHPQGHRRIXPKRPVRGSLMPGH 
THHPCPVSSTTNDTPDQIWVSVGSLRMGTGG 
VfGANAST^PUPWDT ^SGNKK WTTOVPII A<sTV 
ESRG GLL ATG VGGMC AC VPRNQPLTGT 


207 


1557 


A 


2409 


289 


418 


LWTLYRHKQQVQHNHSNRLSCRPSQEDRAT 
HTIMVLDKENTLS 


20S 


1558 


A 


2413 


64 


492 


VQGTGXXFIAFTEAMTHFPASPVWAGMFFL 

MLINLGLGSMIGTMAGITTPI1DTFKVPKEN4FT 

GGCCWAFLVGLLFVQRSGNYFVTMFDDYSA 

TLPLTLIVILENIAVAWIYGTKKFMQELTEML 

GFRPYRFYFYMWKFVSP 


209 


1559 


A 


2417 


3 


877 


EKERLLDEWFTLDEVPKGKLHLRLEWLTLMP 

XT A CXTT PlVWT TT^T V A Flfc" T\l~\ A \inp I COAT t IT V 

LDSARNLPIRYKTNEPVWEENFTFFIHNPKRQ 
DLEVEVRDEQHQCPLGNLKVPLSQLLTSEDM 
TVSQRFQLGNSGPNSTIKMK1ALRVLHLEKRE 
RPPDHQFISAQVKRPSVSKEGRKTSIKSHMSG 
i>rurOuoJN 1 Al o 1 r V lOUbJJJ^OMtbKAQr rh, 

AGPQGLHDLGRSSSSLLASPGHISVKEPTPSIA 
SDISLPIATQELRQRLRQLENGTTLGQSPLGQ1 
QLTIP 


210 


1560 


A 


2422 


35 


456 


REFAASDLEPFTPTDQPISPEAITQPSCIKRQRA 
AGNPGSLAAT1DHKPCSAPLEPKIQASRNQRW 
GAVRAAESLTDIAEPASPQVHETPIDASQTQK 
VEPASBCSRFTPELQAKVSHSRERALSTMDATP 
HHAQPQRGEG 


0 1 1 




A 

A 




1 


/64 


KKl bt^Kl^lQrll AOQLLKI i rAA I RIDo^NPNr 
LMFWLHGIQLVALNYQTDDLPLHLNAAMFE 
ANGGCGYVLtCPPVLWDKNCPMYQKFSPLER 
DLDSMDPAVYSLTIVSGQNVCPSNSMGSPCIE 
VD VLGMPLDSCHFRTKPII IRNTLNPM WNEQF 

T PT-TVT-TPFr^T VPI I? PA VVCWMQQ A VT A TT~PT 

KALKRGYRHLQLRNLHNEVLEISSLFINSRRM 
EENSSGNTMSASSMFNTEERKCLQTHRVTVH 
GVPG 


212 


1562 


A 


2436 


1 


411 


GIRGTTGHLGCPINDDPSLTLTVSWVMEDKPI 
YIGNGTKJCEDDSLTIFA VAKRDH VS DTCGAC 
TDLDHNLDKGYLTVLGEQATPTNRLGALPKG 
RANRTRDLELTYLAERIVRLTWIPGDANNRPI 
TDYDCQIEEHQ 


213 


1563 


A 


2445 


1 


1294 


MSSIGCLWVSRSSQIDGLTAEKSGPEKPHGT 

WLMPELFIPKEQILELLVLEQFLSILPEELQIWV 

QQHNPESGEESVTLLEDLEREFDDPGQQVPAS 

PQGPAVPWKDLTCLRASQESTDIHLQPLKTQ 

LKSWKPCLSPKSDCENSETATKEGISEEKSQG 

SQGGSFSQVIFTNKSLGKRDLYDEAERCLILT 

TDSIMCQKVPPEERPYRCDVCGHSFKQHSSLT 

QHQRIHTGEKPYKCNQCGKAFSLRSYLIIHQR 

IHSGEKAYECSECGKAFNQSSALIRHRKIHTG 

EKA.CKCNECGKAFSOSSYL1IHORIHTGEKPY 

ECNECGKTFSQSSKLIRHQRIHTGERPYECNE 

CGKAFRQSSELITHQRIH SGEKP YECSECGKA 

FSLSSNLIRHQRIHSG 


214 


1564 


A 


2461 


1 


615 


G1PGSTISSSRNIFLEDDLAWQSIJHPDSSNTPL 
STRLVSVQEDAGKSPARNRSASITNLSLDRSG 
SPM VPS YETS VSPQANRTYVRTETTEDERKIL 
LDSVQLKDLWKKICHHSSGMEFQDHRYWLR 
TI-IPNCIVGKELVNWLIRNGIIIATRAQAIAIGQ 
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3 DOC ID: <WO 0157168A2_I_> 



WO 01/57188 



PCT/US01/03800 



SEQ ID ! 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 1 
^O: of 1 
oeptide 
seq- 
uence 


Viet 
nod 


SEQ 1 
ID NO: 
in 

USSN 
09/496 
914 


Predicted ) 

Deginning 

nucleotide 

ocation 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine OCysteme, 
D=Aspartic Acid, li— uiutamic avciu, 
F=Phenylalanine, G^Glycine, H=Histidine, 
[=1 so leucine, K— Lysine, l— lszw^uv^, 
M^Methionine, N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine 9 V=Valine 5 W-Tryptophan ; 
Y=Tyrosine. X=Unknown 3 *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














AMVDGRWLDCVSHHDQLFRDEYALYKPLQV 
LFSVYC0LECSKL1L 


215 


1565 


A 


2464 


3 


2932 


GPGVRSSQDGMADVFVHLRTAWPRCSF1SGQ 

HGPGRHGRRVCSSQDSMADVFVHLRTAWPT 

CSLISGQHGPGESVSYEDDDIPAPASLLHVNA 

AAPALTNPTAPVLCTAPNNTAQKEKVPSGMR 

QRPAGVRJSSRTPDLTCAVSTHSTVPGVRISSC 

TPDLTCAVSIHSTVPSVCISSCTPDLTCAVSTH 

STVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHATVPGVR1SSCTPDLTCAVSIH 

ATVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHSTVPGVRISSCTPDLTCAVSIH 

ATVPGVR1SSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHATVPGVR1SSRTPDLTCAVS1H 

ATVPGVRISSCTPDLTCAVSIHATVPGVRISSC 

TPDLTC A V S IHATVPG VRI S SRTPDLTC A V SIH 

ATVPGVRISSCTPDLTCAVSTHSTVPGVR1SSR 

TPDLTCAVSIHATVPGVR1SSCTPDLTCAVSTH 

STVPGVRISSRTPDLTCAVS1F1ATVPGVHISSC 

TPDLTCAVSTHSTVPGVR1SSRTPDLTCAVSIH 

STVPGVCISSRTPDLTCAVSIHSTVPSVHISSCT 

PDLTCAVSIHSTVPGVR1SSRTPDLTCAVSTHS 

TVPGVH1SSCTTDLTCAVSIHATVPGVHISSCT 

nni n ^ a \;c!XLTTT\^pr;\/T?mSR TPDLTC AVS1HS 

PDL 1 LAV o i rll 1 VruVKlOijivi rui^ i \^r\ v 

TVPGVRISSCTPDLTCAVSTHSTVPGVR1SSRT 
PDLTCAVSTHLTVPGVRISSRTPDLTC A V SIH A 
TVPGVHISSCTPDLTCAVSIHATVPGVRISSRT 
PDLTCAVSIHATVPGVHISSCTPDLTCAVSTHS 

T VPGVRISoKi rUL l L,/V V olrio 1 v rvj v niojv^ i 
PDLTCAVSTHSTVPGVHISSCTPDLTCAVSTH 
STVPGVHISSRTPDLTCAVSIHATVPSVHISSC 
TPDLTCAVSIHSTVPGLLTSVSQTSTG 


216 


1566 


A 


2477 


1 


414 


FRTKSYRKGSYRCIVSEWIAEQGNWQE1QEK 

A VEVATYVJQr 1 YLKAAVrJVJN v o v f\jD\Ji^i-i--Ls 
LTCNITTDRADDVRPEVTWSFSRMPDSTLPGS 
RVLARLDRDFLVHSSPHVALSHVDARSYHLL 

VRD V blsJlN ~>Lt i i * 


217 


1567 


A 


2480 


2 


460 


CRTLCEGPQRFEEYEYLGYKAGLYEAIADHY 
MQVLVCQHECVRELATRPGRLSPIENFLPLHY 
D YI ,QFA YYRV GEY VK ALECAK A YLLCHPDD 
EDVLDNVDYYESLLDDSIDPASIEAREDLTMF 
a/fuui^i fqpt Tkr^AAPfrl GXSYTEPNYW 


218 


1568 


A 


2483 


140 


383 


" AFSSPHPSPAPQFPECGF YGL YDKILLFKHDP J 
SANLLQLVRSSGD1QEGDLVEWLSASATFED 
LQIRPHAlTVHSYRAP 


219 


1569 


A 


2489 


3 


428 


' S SRLVLL AGAAAL ASGSQGDREP V YRDC VLQ 
CEEQNCSG GALNHFRSRQPIYMSLAGWTCRD 
DCKYECMWVTVGLYLQEGIIKVPQFHGKWP 
FSRFLFFQEPASAVASFLNGLASLVMLCRYRT 
FVPASSPMYHTCVAFAWVS 


220 


1570 


A 


Z4y«s 


i 
i 


1297 


"' MDGEAVRFCTDNQCVSLHPQEVDS V AMAP A 
APKIPRLVQATPAFMAVTLVFSLVTLFVVDH 
HIIFGREAEMRELIQTFKGI-IMENSSAWVVE1Q 
MLKCRVDNVNSQLQVLGDHLGNTNADIQMV 
KGVLKDATTLSLQTQMLRSSLEGTNAEIQRL 
KEDLEKADALTFQTLNFLKSSLENTSIELHVL 
SRGLENANSEIQMLNASLETANTQAQLANSS 
LKNANAE1YVLRGHLDS VNDLRTQNQ VLRN S 
LEGANAEIQGLKENLQNTKALNSQTQAFIKSS 
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MSDOCID <WO 0157188A2_I_> 



WO 01/57188 




'CT/US0 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SCQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SCQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D=Aspartic Acid, E~G lutamic Acid, 
F=Phenylalanine, G=Glycine ? H-=Histidine s 
l=lso leucine, K=Lysine, L=Leucine ? 
M=Methionine 5 N=Asparagine, P-=Proline. 
Q^Glutamine., R^Arginine, S=Serine, 
1 = 1 nreoninc, V=vanne, w— J ryptopnan. 
Y^Tyrosine, X^Unknown : *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














FDNTSAE1QFLRGHLERAGDEIHVLKRDLKM 
VTAQTQKANGRLDQTDTQIQVFKSEMENVN 
1 JLNAQ1Q VLNOriMKNAoKblQI LKl^UMKNA 
SALTS QTQMLDSNLQKASAEIQRLRGDLENT 
KALTMEIQQEQSRJLKTLHVVITSQEQLQRTQ 


221 


1571 


A 


2501 


3 


500 


RVRJLNNDGLSPLMMAAKTGKIGIFQHIIRREV 

TDEDTRIILSRKPKDWAYGPVYSSLYDLSSLD 

TCuEEASVLEILVYNSKIENRHEML 

LLRDKWRKFGAVSFYINVVSYLCAMVIFTLT 

AYYQPLEGTPPYPYRTTVDYLRLAGEVITLFT 

GVLFFFTN 


222 


1572 


A 


2508 


3 


395 


DAHCQRJCLAMQEFMErNERLTELHTQKQKL 

ARHVRDKEEEVDLVMQKv'ESLRQELRRTER 

AKKELEVHTEALAAEASKDRKLREQSEHYSK 

QLENELEGLKQKQISYSPGVCSIEHQQEITKL 

KTDLEKKS 


223 


1573 


A 


2544 


2 


412 


NDPAI1SNFSAAVVHTIVNETLESMTSLEVTK 
M VDERTD YLTKSLKEKTPPFSI ICDQ A VLQC S 
EASSNKDMFADRLSKSIIKHSIDKSKSVIPNID 
KNAVYKESLPVSGEESQLTPEKSPKFPDSQNQ 
LTHCSLSAA 


224 


1574 


A 


2552 


401 


1 


GASLCFISTAFTVLTFLIDSCRFSYPERPIIFLSM 

CYNIYSIAYIVRLTVGRERISCDFEEAAEPVLI 

QEGLKNTGCAIIFLLMYFFGMASSIWWVILTL 

TWFLAAGLKWGHEAIEMHSSYFH1AAWA1PA 

VK 


225 


1575 


A 


2563 


724 


1 


M S ARKERREKGEEEGEGEKDGDEDEKEEEKE 

GLGEEEEKEAGKKKKKQEEKEKEKGAVYSR 

VARICKNDMGGSQRVLE1GIWTSFLKARLNC 

SVPGDSFFYFDVLQSITDIJQINGIPTWGVFTT 

QLNSIPGSAVCAFSMDDIEKVFKGRFKEQKTP 

DS V WTA VPEDK VPKPRPGCCAKHGLAEA YK 

TS1DFPDETLSF1KSHPLMDSAVPPIADEPWFT 

KTRVRYRLTAISVDHSAGPYH 


226 


1576 


A 


2571 


449 


3 


EGVLFVYGNYVGDVMNFEMAAEMAQEVAIP 

TRTVLTTDDISSSPIEDRDGRRGVAGNFFIFKV 

AGAACDRGMSLEACEAVTRKANRJRTYTMG 

VALEPCSLPQTRRHNFEIGAEEMEIGMGIHGE 

RGVIREKMMPADAIVDHIMDR1FS 


227 


1577 


A 


2575 


3 


1197 


VLSDLCLFYYRDEKEEGTLGSILLPSFQIALLTS 

EDHINRKYAFKAAHPNMRTYYFCTDTGKEM 

ELWMBCAMLDAALVQTEPVKRVDKITSENAP 

TK£TNNIPNHRVLIKPEIQNNQKNKEMSKIEE 

KKALEAEKYGFQKI)GQDRPLTKINS VTCLNSL 

PSEYESGSACPAQTVHYRPINLSSSENKIVNVS 

LADLRGGNRPNTGPLYTEADRVIQRTNSMQQ 

LEQ WIKIQK GRGHEEETRG VIS YQTLPRNMPS 

HRAQIMARYPEGYRTLPRNSkTRPESlCSVTP 

STHDKTLGPGAEEKRRSMRDDTMWQLYEW 

QQRQFYNKQSTLPRHSTLSSPKTMVNISDQT 

IVLrioIr 1 or orlvjo JZ-VrV r yU i or V^lvl i lvoc< V obi J 

QRGDVTIDRRHRAHHPKVK 


228 


1578 


A 


2583 


3 


330 


LPFLGLGSVLPQGMVMASPEMNPTICSVFEA 
HIVLLFIiATTFRRGFQVTVLVGNVRQTAVVE 
KJHAKVRGTWPFISPEVRKEGGLPQTGRELLD 
PTMGIKPHLWWVAA 


229 


1579 


A 


2589 


1 


448 


DDKNAQGIKRHVKPTSGNAFTICKYPCGKSR 
ECVAJPNICKCKPGYJGSNCQTALCDPDCKNH 
GKCIKPNICQCLPGHGGATCDEEHCNPPCQH 
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SOOCID: <WO 0157188A2J_> 



WO 01/57188 



PCT/U SO 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
N40: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid. E=GIutamic Acid, 
^Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine> L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W^Tryptophan, 
Y~iyrosinc, /v — L/ufvjiuvvu, oivp vuuvn, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














GGTCLAGNLCTCPY GFVGPRCETM V CNRHC 
ENGGQCLTPDICQCKPGWYGPTCSTA 


230 


1580 


A 


2593 


2 


138 


A V 1 rbVVrAi VAJL/l I K^lZfrl&L\.&iyu\ i v ^mi i 

LYLLYLLRNAFFLR 


231 


1581 


A 


2595 


185 


2 


SGPYTDFTPWPTEEQKLLEQALKTYPVNPPER 
WEKJAEA V ruK 1 KK>vL^liVK i kv /VL/.L*\.-iorv 


232 


1582 


A 


2596 


1 


391 


STVTGQPRRLLDTAGHQQPFLELKIRANEPGA 
GRARRRTr 1 Lbr A 1 r LCL,l\J\Juri i v intv^ili^vj vv 
RDWILLPEGYQLNYCSGQCPTHLAGSPGIAAS 
FHSAVFSLLKANNPWPGRTSWCVFTARRPLS 
LLYL 


233 


1583 


A 


2601 


184 


403 


LLFSDEIIMAAPLRIADVTSGLIGGEDGRVYV 

YNGKETTLGDMTGKCKSWITPCPEEKVNVLQ 

NSIPYWERIT 


234 


1584 


A 


2614 


17S 


335 


' PLTLCLPENNKPPQ ADA VPDKELTLP VDSTTL 
DG SKSSDDQKI1S YLWEKTQ 


235 


1585 


A 


2616 


2 


896 


DVLEVYGTGVASTRHEMGTLDKHKELEDLV 

AKFLN VE AAMV FG M GF ATN SMN I PAL V u Kaj 

CLILRDEVNHTSLVLGARLLGATIGIFKHNYA 

QSLEKLLRDAVTYGQPRTRRAWKK1LILVEGV 

YSMEGS1VHLPQHALKKKYKAYLYIDEAHSI 

GAVGPTGRGVTEFFGLDPHEVDVLMGTFTKS 

FGASGGYIAGRKARILSPPACLVPN ivjbrlbLM 

RLTRDLQMNEAMVALVTDRLQGWNSGEGN 

WDRADKFGDLVDYLRVHSHSAVYASSMSPPI 

AEOHRSLKLIMGLDGTTQ 


236 


1586 


A 


2621 


1 


392 


NTSSFPAQPSSPARPSLPHLSQHPSNPLLPLAS 
ADHPQCGRFLPLHEPEPLCPSPbLbi Lvbb 
WS SPFSSHHGCPPGL YPFPTSPKTIQPPG L AQL 
KMLC1PPGRQQLRGAQSMPGHGALSPLLLPP 
A 


237 


1587 


A 


2628 


39S 


1 


DLVCKISGFGRGPRDRSEAVYTTMSGRSPAL 
W AAPETLQFGHFSSASD VWSFGIIMWEVMAF 
GERPYWDMSGQDVIKAVEDGFRLPPPRMCPN 
LMHRLMLDCWQKDPGERPRFSQIHSILSICN'IV 

QDPEPPNV 


238 


1588 


A 


2631 




1104 


WSPCSLTCGVGLQTRDVFCSHLLSREMNETV 

1LADELCRQPKPSTVQACNRFNCPPAWYPAQ 

WQPCSRTCGGGVQKREVLCKQRMADGSFLE 

LPETFCSASKPACQQACKKDDC1 bbWLLbUW 

TECSTSCGEGTQTRSAICRKMLKTGLSTVVNS 

TLCPPLPFSSS1RPCMLATCARPGRPSTKHSPHI 

AAARKVYIQTRRQRKLHFVGGGFAYLLPFCTA 

VVLRCPARRVRKPL1TWEKDGQHLISSTHVT 

V APFGYLKIHRLKPSDAG VYTCS AGPAREHF 

VIKLIGGNRKLVARPLSPRSEEEVLAGRKGGP 

KEALQTHKHQNGIFSNGSKAEKRGLAANPGS 

RYDDL V SRLLEQG APCS S SKKKN 


239 


1589 


A 


2636 


1 


678 


MKPDN1LLDEHGHVH1TDFN1AAMLPRETQ1T 

TMAGTKP YM APEMFS SRKG AG YSFAVD W W 

SLGVTAYELLRGRRPYI1IRSSTSSKEIVHTFET 

TVVTYPSAWSQEMVSLLKKLLEPNPDQRFSQ 

LSDVQNFPYMNDINWDAVFQKRLIPGFIPNK 

GRLNCDPTFELEEMILESKPLHKJCKKRLAKK 

EKDMRKCDSSQTCLLQEHLDSVQKEFniNRE 

KVNRDC1 


240 


1590 


A 


2639 


389 


3 


- " ELLDPTTPMRTKClELLYAAL'itJSSTDQPKAD 
LWQNFAREIEEHVFTLYSKNIKKYKTCIRSKV 
ANLKNPRNSHLQQNLLSGTTSPREFAEMTVM 
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MSOOCID- <WO 01571B8A2_I_> 



WO 01/57188 




'CT7US01/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: Oi 
peptide 
seq- 
uence 


Met 
nod 


SEQ 
1L> NU: 
in 

UOOlN 

09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutarnic Acid, 
F=Phenylalanine. G=Glycine, H=Histidine, 
I^Isoleucine, K— Lysine, L=Leucine, 
M^Methionine, N=Asparagine, P=Proline, 
Q=GJutamine ? R=Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \ = possiblc 
nucleotide insertion 














EMANKELKQLRASYTESCIQEHYLPQVIDGTL 
Y 


241 


1591 


A 


2640 


392 


3 


IRLTILRCVFMRLATICVLVFTLGSKITSCDDD 
1 LULLuYNQkLi rCWblQVuQbMYKJ^MlrD 
FI1ILAVTLF\ / DFPRKLLVTYCSSCKJLIQCWGQ 
QEFAIPDNVLGIVYGQTJCW1GAFFSPLLPAM 
Y 


242 


1592 


A 


2642 


405 


1 


YFKNTTLLLVGVICVAAAVEK\VNLHKRJALR 
MVLM.^UAKrvjMLLLtrlVlLC rTLLSMWLSNT 
STTAMVMPIVEAVLQELVSAEDEQLVAGNSN 
TEEAEPISLDVKNSQPSVEL1FVNEDILDFLMK 
SPLMISQACI 


243 


1593 


A 


2646 


412 


2 


CLAMIKGIQSSGKIIYFSSLFPYVVLICFLIRAF 
LLNGSIDGIRHMFTPKLEIMLEPKVWREAATQ 
VFFALGLGFGGVIAFSSYNKRDNNCHFDAVL 
V SFrNFKI'S VLATLV VFA VLGFKAN V 1NEKC1T 
QNSETV 


244 


1594 


A 


2650 


1 


1271 


MTTTLIGLLKTARLLRLVRVARKLDRYSEYG 

AAVLMLLMCIFALIAHWLACIWYAIGNVERP 

YLTDKIGWLDSLGQQIGKRYNDSDSSSGPSIK 

DKYVTALYFTFSSLTSVGFGNVSPNTNSEKIF 

SICVML1GSLMYASIFGNVSAIIQRLYSGTARY 

HMQMLRVKEFIRFHQIPNPLRQRLEEYFQHA 

WTYTNGIDMNMVTNGTCSSCTSDDGHFILVS 

NHHQGGLIYS^DAASMQRPFNHIKSSLLGS 

TSDSNLNKYSTINKIPQLTLNFSEVKTEKKNSS 

PPSSDKTIIAPKVKDRTHNVTEKVTQVLSLGA 

D VLPE YKLQ APRJTNKF 1 1LHYSPPKAVWDWLI 

LLLVIYTAIFTPYSAAFLLNDREEQKRRECGY 

SCSPLNVVDLIVDIMFIIDILINFRTTYVNQNEE 

VVSDPASV 






A 


2656 


385 




NLTWWPLFIUjVSFYIVDLIKILIIFFLDNVIMW 
WESLLLLTAYFCYVVFMKFNVQVEKWVKQ 
MINRNKVVKVTAPEAQAKPSAARDKDEPTJLP 
AKPKLv^KOO bbAb LH N b LMKN bit 4 QNKI HTLD 
PHV 


246 


1596 


A 


2660 


200 


506 


VLVLQMNYYQMLIIYYVLFFKVNEFLAFEGPI 
LLDMRIKHLIKTNQLSQATALAICLCSDHPEIG 
1KGSFKQTYLVCLCTSSPNGKLIEEVSMFSF1S 

XTVT7I C 

In YrLj 


247 


1597 


A 


2678 


3 


267 


DAWVKNDIIFNQTERKQKISENLKHLASVRV 
VQKNLVFVVGLSQRLADPEVSPLVFFVILIFF 
VSLSYLEirFDPAQLCDSSEHIIS 


248 


1598 


A 


2687 


1 


404 


DFTTLAAMMRTLFSLFGDVRSDVHRFSVTLF 

/"» A A TI'C\/VXrDrM^l/C1D\TA\/T T\CI WT>I r I WO/"\r\ 

OAAlKb VKNPUJ^^MbNQVJLUbLVFLLLYSQD 
ENDAVAEESRQVLTICAQFLKWKLPREVYSK 
DPWHIKPTEAGTICRFFEKKCKGKINILEQTL 
MYSKNPKL 


249 


1599 


A 


2692 


1 


440 


FRRRRRRRERDCAAQGARRHCRHLAECKXV 
SFPIGIYKYLRNVSGQIHLITLANNELKSLTSK. 

AIDLSRNQFQDFPEQLTALPALETINLEENEIV 
D VP VEKL AAMPALRSIN L 


250 


1600 


A 


2693 


459 


21 


LLPGSJLGVP1LHSQPWDPSPQCPHRAPSTPRRL 
PPLGALSQALTFLSRAAKNHSQDPGKGTKPFP 
AAPAAPPPRSSLPAPLPMGLKDKGPQPAPPTIF 
NSPWHPATLPGALGPQLSQAAPSPIPPPCLMG 
ISSCPDLKLTKSSTP 


251 


1601 


A ! 


2694 


2 


404 


FVFDLKLRVPGFAALLIHGASSVPGPETVRLR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D-Aspartic Acid, E=Glutamic Acid, 
F=Phenylatanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, 1 .^Leucine, 
M=Methionine, N=Asparagine, P=Proline ? 
Q=Glutamine s R=Arginine, S^Senne, 
T=Threonine, V-Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon. 
/-possible nucleotide deletion, \=possible 
nucleotide insertion 














QKRKJCKAPDHSSGKJsJiiiLV 1 1 ill v uivl,e, i isjs. 
PVGRVLCGLSGELLHSLLLPRRKTEKRALGSH 
RKAGFPEHPVAPEPLSNSCQISKEGREQVLSEI 
GAGDCL 


252 


1602 


A 


2697 


421 


1 


PQKSHSGAYQCFATRKAQTAQJJrAJlAi.iiUU 
TPR1VSSFSEKWNPGEQFSLMCAAKGAPPPT 
VTWALDDEPl Vl<JJvjc>Hivl Ni^ * iiviii^vji 11011 
MNVTGPQIRDGGV YRCTARNLVG SAEYQAR1 
NVRGPPSIRAMRN1T 


253 


1603 


A 


2698 


65 


401 


" ACCQWRRTLIPAKSTTVSCTISTPHHPFRGSYS 
FDDHITDSEALSRSSHVF'1'SHPRMLICRQPAIEL 
PLGGEYSSDVPRPLSTQLSSSLLGYFSTLMTG 
AAFTNNIAS STI1L 


254 


1604 


A 


2699 


438 


301 


"GQIHSQDDPPFIDQLGFGVAPGFQTFVACQEQ 
RVRGPWEAGPGVGY 


255 


1605 


A 


2700 


1 


842 


LQNRJEDSSEGIRlU^VliAtELEEkKRbAQ v b 

AQHLE VHLKQKEQHYEEKIK VLDN QIKKDL A 

DKETLENMMQRHEEEAHEKGKILSEQKAMIN 

AMDSKIRSLEQRIVELSEANKLAANSSLFTQR 

NMKAQEEMISELRQQKFYLETQAGKLEAQN 

RKLEEQLEKISHQDHSDKNRLLhLli 1 KLKfc v b 

LEHEEQKLELKRQLTELQLSLQERESQLTALQ 

AARAALESQLRQAKTELEETTAEAEEEIQALT 

VGLGSNIFRLLKASARMSVELALSILAI-IP 


256 


1606 


A 


2701 


2 


405 


FVGGPGADPPVAVMWDPRAARMDLTAYAE 
LLKESGNQVLKNGNFSLAIRKYDEAIQILbqL 
YQWGVPPRDLAVLLCNKSNAFFSLGKWNEA 
FVAAKJECLQWDPTYVKGYYRAGYSLLRLHQ 
PYEAARMFFEGLR 


257 


1607 


A 


2702 


2 


399 


FVESASSRPPGCFSGDGRFWLVSEGSRRGWD 
FNP SF SFLDPRY S VGGDENIGTVTTL ANILREF 
NPSLKGFSVGTGKETSPNAFLNQAVAGGRAE 
DLPVQARRLVDLMKNDTRIHFQEDWKUTLFI 
GGNDL 


258 


1608 


A 


2709 


1 


1097 


SVGARQGEARDR1RREFPKGDLEVLQAQVERI 

MTRKELLTVYS SEDGSEEFETIVLKAL VKACG 

SSEASAYLDELRLAVAWNRVDIAQSELFRGDI 

Q WRSFHLEASLMD ALLNDRPEF VRLLlbl-iU Lb 

LGHFLTPMRLAQLYSAAPSNSLIRNLLDQASH 

SAGTKAPALKGGAAELRPPDVGHVLRMLLG 

KMCAPRYPSGGAWDPHPGQGFGESMYLLSD 

KATSPLSLDAGLGQAPWSDLLLWALLLNRA 

QMAM YF W EMG S In A V b b ALAj AC L, I J l. ic v ivi/\k 

LEPDAEEAARRKDLAFKFEGMGVDLFGECYR 

SSEVRAARLLLRRCPLWGDATCLQLAMQAD 

ARAFFAQDGVQSLPTQKWWGDMARR 


259 


1609 


A 


2721 


1 


403 


" VYLGAGPGLFFSNEGAKEGEKANIPKLMLPR 
GGFSQREMVTGERSPSPEEEEEEEEEGFGERA 
SCRRGLFRVTU/TRVGLAAPSKASRGQbuDAA 
PKSPVREKSPKFRFPRVSLSPKARSGSGDQEE 
GGLRVRLP 


260 


1610 


A 


2728 


1 


477 


LLGGDLRYHLQQNVHFTEGTVKLY1CELALA 
LEYLQRYHIIHRDIKPDNILLDEHGHVHITDFN 
IATVVKGAERASSMAGTKPYMAPEVFQVYM 
DRGPGY S YP VDW WSLGITA YELLRGWRPYEI 
HS VTPIDEILNMFKVERVHY SST WCKGM VAL 
1 ,RK. 


261 


1611 


A 


2730 


3 


547 


* LTITDFILVLYRYYRSPLVQIYEIEQHKIETWR 
EIYLQGCFKPLVSISPNDSLFEAVYTLIKNRJH 
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SEQ ID 

IMfV of 

nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO- nf 

peptide 
seq- 
uence 


Met 
nuu 


SEQ 

IU INvJ. 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 

IIt£ IU Alibi 

amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 

duiu icmuuc 

of peptide 
sequence 


Amino acid sequence (A=Afanine C=Cysteine ; 
D-Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isolencine, K=Lysine, L-JLeucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine_, R~Arginine, S~Serine, 
T-Threoninc, V=VaIine. W=Tryptophan, 
Y=Tyrosine, X=Unknovvn, *=Stop codon, 
/=possib!e nucleotide deletion, \=possible 
nucleotide insertion 














KLrvLur VbON VLH1L JHlvKLLKr LHJrGSLLP 
RPSFLYRTIQDLGIGTFRDLAVVLETAPILTAL 
DIF VDRR V S ALA V VNECGTHPQDERLGLG \V 
GLGEPGSEERLFPAAITSR 


262 


1612 


A 


2733 


j 


A"X 1 
HJ 1 


f""513'E"PPf*~lC A VI WITT l"\r CVMMf TAI f y A A T^Tl C A 

Ol^crrOoAJSJL Vr.LL)Li> i NInL J l^l^o A(jr Ar Kb A 

GRLVKLSLANNNLVGVHEDAFETLESLQVLE 
LNDNNLRSLSVAALAALPALRSLRLDGNPWL 

ESRRISLRACRRPASRV 


263 


1613 


A 


2736 


2 


343 


PARISGVDPPVRKATKGGENCSFEDNKNWQF 
L WGLNGN FNFFKEP WGGRNNHAKGFRTT W 
ARSSSQNNRTFQNNRNFLRLQRDSQKKGQFA 


264 


1614 


A 


273S 


2 


245 


RAMLKCLREG QPPPS YN WTRLDGPLPSGVR V 
DGDTLGFPPLTTEHSGIYVRHDTNEFSSRDSH 


265 


1615 


A 


2752 


2 


388 


AAGDAPLRSLEQANRTRFPFFSDVKGDHRLV 
LAAVETTVLVLIFAVSLLGNVCALVLVARRR 

DD^ ATA /"""I \/T XII A T^T T 1?TC A TT>¥ 171 A \ 7™r» \i r-i- 

KKvj A 1 AtLV LIN JLr C ADJLLr 1SALPLVLA VR WT 
EAWLLGPVACHLLFYVMTLSGSVTILTLA.AV 
SLER 


266 


1616 


A 


2755 


192 


1 


AFREVGGYWGLLCEHLYAIPSKTSEGNWTAK 
LQGYLPLQDAFHIFQDPLTGDLPWPELILGLP 

V 


267 


1617 


A 


2760 


434 


714 


ASRLEKQNSTPESDYDNTPNDMEPDGMGYM 
HRTSVPGEGLPRARDLAGLGQQKQFTTHTPF 
Yri^ I HKULl<JJoMK.£>fc,V J CLOloQC WRKGFF 


268 


1618 


A 


2762 


1 


405 


IACTFCGQDEWSPERSTRCFRRRSRFLAWGEP 

AVLLLLLLLSLALGLVLAALGLFVHHRDSPL 

VQASGGPLACFGLVCLGLVCLSVLLFPGQPSP 

ARCLAQQPLSHLPLTGCLSTLFLQAAEIFVESE 

LPLSWAE 


269 


1619 


A 


9777 


-2 
J 




TDD A Cff AVI \/T TTCA FK ^CUDCA A V/T\r-' I Pt nnrri 

1 KJ^AbKlQYLVLrr VMbHrSQAYDKLSLSDHL 
LIAVLNLLRREVSEHGRHLQQYFNLFVMYAN 
LSKNLSFSEFCFDVSY 


270 


1620 


A 


2789 


1 


486 


ELQSQQACTHTKETEQLRSQLQTLKQQHQQA 

VEQ1AK.AEETHSSLSQELQARLQTVTREKEEL 

LQLSIERGKVLQNKQAEICQLEEKLEIANEDR 

xsJ-1 AJ-liKr t^bAVAV L; biN LK VRLLQRK VDGIQ 

KAYDELRLQSEAFKKHSLDLLSKERELNGKL 

RHLSP 


271 


1621 


A 


2795 


1 


568 


KEKRV 1 VQLPFES1QKNQEDKX1CMVPRKQRE 

FSGSDRGKLPGSEEKNQGPSM1GRKEERLITE 

RKHEHLKNKSAPKVVKQICVIDAIiLDSQTQN 

PAnTAT^TA CCr A CUfl/I DADVATC7 i^im^/~T r-» 

r J v^v ' AtDKAJ^rlisjK.Lr(^r YNbLQbEKCLE 
VKGIQEKQVFSNTKDSKQEITQNKSFFSSVKE 
SQRDDGKGALNIVEFLRKREELHQILSTVKQP 


272 


1622 


A 


2797 


8 


523 


KCMQGKYAGAMESEPCVCTEADFDCDYGYE 
IvhloNOVsJULl Ar Wr NPooLbKJJCbLOQo YLNST 
GYRKVVSNNCTDGVREQYTAKPQKCPGKAP 
RGLRIVTADGKLTAEOGHNVTI MVOT FFnn 
VQRTLIQVDFGDGIAVSYVNLSSMEDGIXHV 
YQNXG1XRXTVQVDNSLGS 


273 


1623 


A 


2S01 


72 


395 


HPSRSNVGPRQLTV\WTSNLSHDNRRKYIFS 
DEEG QNQLGIRIHQDIPLPPRRRELPALRTTNG 
KADSLNVSRNSVMQELSELEKQ1QVIRQELQI. 
AVSRKTELEEYH 


274 


1624 


A 


2805 


168 


320 


ILWLYFETGTWVYPVFAKLSLLGLAALFSLRE 
I FLARN G V V GETLTHCKR V 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


^mino acid sequence (A=Alamne C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G^Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine 3 N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknown 3 *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 


275 


1625 


A 


2812 


208 


321 


GSLATCQLSEPLLWFILRVLDTSDALKAFHD 
MGKIIFQ 


276 


1626 


A 


2813 


41 


266 


AGRSLHGAGDRAWVGISPTDWSPKVVELCK 
KYQQQTVVAIDLAGDET1PGSSLLPGHVQAY 
QVGPVRRN GE AGPG 


277 


1627 


A 


2817 


3 


410 


VLQERLDNFQRKCIQLASSTEGKVDKLLMRN 
LFISYLHTPKHKQHEVLQAMGSILGITGEEME 
PLFQEEHGTATRWMTGWLEGGSKSVPKTPL 

«. T — . An . » -k !/>ornri r\;i/Cl t 7 TCCI CCT1 PTY 

GLNQQP AL.NGSFSEI -FVKFLK I bi»Lbb lLf ia 
LPPHNSPG1CIK 


278 


1628 


A 


2821 


238 


457 


GLSGPSCSCPHSPLPT1ISRAQLETALKWRN Yfc 
VKLRLLLHLEELQMEI lUiKli Y DLkb v rm jwu 
PVDQNPRLV 


279 


1629 


A 


2822 


342 


3 


PLIPANLPAHSNPLQPLPSLPHPFLPATHKFP F 
TPPTFSSVPPPLPSLSSILHHSPLHSELNPHLQS 
CRLPSRPSVSRELPPQSOPAbb v VL,/\r i rLfus 
VPSQRHPTXPPPAS 


280 


1630 


A 


2825 


307 


77 


PSMVWSYHWGVKQKRLALCVFSFEEGGRRK 

CGQYWPLEKDSRIRFGFLTVTNLTGAVGEPG 

VAFQCDGQRRREPTC 


281 


1631 


A 


2827 


81 


381 


KMGTAVWVPKEKEKRDKASQEGGDVLGAR 
QDCTPSLKSLVATGNLLDLEETAKAPLSTVSA 
NTTNMDEVPRPQALSGSSWWVSGCVASRS 
VILSLTSG 


282 


1632 


A 


2S30 


471 


160 


KLPXDKYELEPSPLTQYILERKSPHTCWQVFV 
TSSGKYNELGYPFGYLKASTTLTCVNLFVMP 
YNYPVLLPLLDDLFKVHKLKPNLKWRQAFDS 

YLKTLPPYYL 


283 


1633 


A 


2835 


462 


148 


VSPALSLTPTIFSYSPSPULbPh 1 SSSUhSFNrbb 
MKHYI *HSQ ACS VFNYHLSPRTFPRYPGLM VP 
PLQCQMHPEESTQFS1KLQPPPVGRKNRERVE 

SSEESAP 


284 


1634 


A 


2836 


2 


384 


KTLPRTLLDILADGTILKVGVGCSEDASKLLQ 
DYGLVVRGCLDLRYLAMRQRNNLLCNGLSL 
KSLAETVENFPLDKSLLLRCSNWDAETLTED 
Q VIY AARD AQI S V ALFLHLLG YPFSRNSPGEK 
KR 


285 


1635 


A 


2843 


20 


271 

27S 


PIRPYYS Y SGLDRDC b W LrL AKA w l,i u v ivuLv 
VCDRVSEDGINRQQAQEWCIKHGFELVELSP 
EELPEEDGKCLCVRRKYGTYI 
TAEDVLTVAYEHG VNLFDTAE V Y AAGK 


286 
287 


1636 
1637 


A 
A 


2845 
2851 


197 
2 


427 


" FVAEVRREWAKYMEVHEKASFTNSELHRAM 
NLHVGNLRLLSGPLDQVRAALP 1 rAJ^brrvUrv 
AVLQNLKR1LAKVQEMRDQRVSLEQQLRELI 
QKI5DITGSLVTTDHSQMKKLFEEQLKKYDQL 
KVYLEQNLAAQDRVLCALT 


288 


1638 


A 


2859 


2 


469 


F VNLGILTCIECSGIHREMGAHI SRI QSLELDK 

LGTSELLPAKHVGNNSFNDlMEANLPbrbPIJvr 

TPSSDMTVRKEYITAKYVDHRFSRKTCSTSSA 

KLNELLEA1KSRDLLAL1QVYAEGVELMEPLL 

EPGQELAETALHLAVRTADQTSLHLVE 


289 


1639 


A 


2861 


2 


454 


" FVASGGPATARMSDSQFFCVAEERSGHCAW 
DGNFLYVWGGYVSIEDNEVYLPNDEIWTYDI 
DSGLWRMHLMEGELPASMSGSCGACINGKL 
YIFGGYDDKGYSNRLYFVNLRTRDETY1WEK 
ITDFEGQPPTPRDKLSCWVYKDRLIYFG 


290 


1640 


A 


2868 


I 


378 


; FRQGQLYKVFLHGSQGQVYHSQQVGPPGSA1 
SPDLLLDSSGSHLYVLTAHQVDRIPVAACPQF 
PDCASCLQAQDPLCGWCVLQGRCTRJCGQCG 
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SEQID 

NO. Ot 

nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: or 
peptide 
seq- 
uence 


Met 
nod 


SEQ 
ID piU. 
in 

09/496 
914 


Predicted 
beginning 
nucleotide 

1 r\ \ f~\r\ 

lOUdllUIl 

correspond i 
ng to first 
amino acid 
residue of 

pcpnuc 

sequence 


Predicted end 
nucleotide 
location 
correspond i ng 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc C=Cysteine, 
D=Aspartic Acid, E=Glutam ic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
1— I so leu cine, K=Lysine, L=Leucine s 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine 9 
T=Threoninc, V=Valinc, W=Tryptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, N^possible 
nucleotide insertion 














T2 Anew Tvir^wn urc\n?cncur , T uinci i d^iijixdd 
QE 


291 


1641 


A 


2870 


1 


385 


FRYMPNNRQQLLRKRHIGNDIVTIVFQEPGAL 
PFTPKSIRSHFQHVFVJVKVHNPCTENVCYSV 
GVSRSKDWPFGPPIPKG\OTPKSAVnU>FLL 

LA 




1 64-? 

i 0*T— 


A 




-i 
o 


1 

1 oo 


T? PT1? PPP A TTHQ PPQTK/TTkTQ 1 VVPVQATT TkT "VI 

ivr l lvrr r A. l l ^orto 1 1VUJ 1 bLl\.ISJtlvoAILlJLr Y 1 
PPPPAVPYSPRYVAVHCHGMLVSCWCH1 


293 


1643 


A 


2878 


1 


427 


REKEEE VEEEEDK V VKETEKEAEQEKEEDSL 
GAGTHPDAAIPSGERTCGSEGSRSVLDLVNYF 

GPCYLILTLLRFSFDLRTMRRRKILDDVSIPLL 

T PT PT Ar,m?r,f^AVTM 


294 


1644 


A 


2879 


109 


245 


QLCCFCFRQTTLIVYILSFIGMVIFTFTLDLRYI 
IIVFVTGGVLG 


295 


1645 


A 


2880 


3 


320 


LASSQHG1LNNLSLLFSICKTCIRTMDHHCPRA 
NNCVGEQNHRFFCALHCKSKHFCIEFTLNTNF 
riMLr LroAbKb I lUAr r SLQrr LQDSIv YNTALS 
LSESTSQ 






A 

A 








bHbLDYHLLQV 1 KNPrl LGDSSNPGQTE 
RLQEFSQKMDQVRGHWPVST 


297 


1647 


A 


2893 


8 


424 


SPXTLXLDTFILLGIQDNILVLILATPPFMAGG 
KLYSTMGRFLRDRKNPACREMAVVLLANLA 
QGDSLAARAIAVQKGSIGHLLGFLEDSLAAT 
Q1QQSQASLLHMHNPPFEPTSVDMMRRACRA 
L»L» AL Atv V DUN H b.br 


298 


1648 


A 


2894 


310 


445 


FWIYFPSFFMTGYLPLGFEFAVEITYPESEGTS 
SGLLNASAQVNL 


299 


1649 


A 


2898 


1 


492 


KIKAKNLTNYDLCSIFLGTSTLLVWVGVIRYL 
G YFQA YN VLILTMQ ASLPK VLRFC AC AGMI Y 
LGY IrCGWlVLGPYHDKFENLNTVAECLFSL 
VNGDDMFATFAQIQQKSILVWLFSRLYLYSFI 
SLFIYMILSLFIALITDSYDTIKKFQQNGFPETD 


300 


1650 


A 


2901 


I 


445 


P V W WNSLNG ASE VTF S VH VKDGG SFPKTDST 
TVTVRPVNKADFPKVRAKEQTFMFPENQPVS 
SLVTTITGSSLRGEPMSYY1ASGNLGNTFQIDQ 
LTGQVSISQPLDFEKIQKYWWIEARDGGVPP 

PQC VP ITT r^TT\/T F»\/"MrV)\J A PT"P 


301 


1651 


A 


2902 


162 


433 


THFICLPLG YCFPLLDKDLQLP SG FN CNFDFLE 
Jtr CO W iVl i DrlAls. VV LK 1 1 WAoboiJriNJL>Kl rru 

KPAVSEDMKELRPACSTYFNPRFPYKL 


302 


1 

1 OJ- 


A 


7000 






urVdlviLUJvJM i Jr 1 W V llvo^O^rilWJLAlJlMl^JbV 
EENDHQDLVSVHIYVTQLAEKFDLRTTMLYI 
CERHFQKVLNRSLFTGLRSITHFGRPPFEPFFN 
SLQEVHPQVKJCIGVFSCGPPGMTKNVEKACQ 
7 \mpnr>p AHPU 


303 


1653 


A 


2914 


291 


453 


KLNRWLCFFYSWSFGILLYEMVTLGAPPYPE 
VPPTSILEHLQRRKIMKRPSSCS 


304 


1654 


A 


2926 


179 


354 


NKDVQREIADLGEVGAASLPPSSGPGA 


305 


1655 


A 


2938 


135 


438 


GMGYLIIAKGILI^KDLKSKKSTYDNGKVVIT 
DFGLFSISGVLQAGRREDKLRIQNGWLCHLA 
PEIIRQLSPDTEEDKLPFSKFISDVFALGTIWYE 
LHAREWP 


306 


1656 


A 


2944 

1 


2 


329 


VRWNSCVNCSCAFGNGASLSTSLGESSGCLW 
EIGKWLSCSLLSFPSPLAVLIITFCIVTVLGREA 
LTKGALWAVFLLAGSALLCAEVTGVIWRQPE 
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SEQ ID 5 
NO: of I 
nuci- F 
eotide 5 
seq- i 
uence 


>EQ ID I 
vIO: of 1 
>eptide 
;eq- 
jence 


vlet i 
lod 1 
i 
1 
< 
< 


5EQ I 

D NU. 1 

n J 

JSSN 1 
)9/496 < 
?14 i 


'redictcd 1 
>eginning i 
nucleotide 1 
ocation « 
;orrespondi i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end / 

mr^fVitide J 

ocation 1 
:orresponding 
o last amino I 
acid residue 1 
of peptide 
sequence 


\jnino acid sequence (A=Alamne C=Cysteine s 
}=Aspartic Acid, E^Glutamic Acid, 
^Phenylalanine, G=Glycine, H=Histidine, 
^Isoleucine, K=Lysine, L=Leucine, 
^Methionine, N=Asparagine, P-=Proline, 
3=Glutamine, R-Arginine, S=Serine, 
r=Thrconine, V=Valine, W=Tryptophan 5 
Y=Tyrosine s X-Unknown, ^-Mop coaon, 
possible nucleotide deletion, \=possible 
nucleotide insertion 
SKTKLSFKVSSSA 


307 


1657 


A 


2950 


2 


411 


NYLCIAKNSAGSAMGKTRLWQVPPVIENGL 
PDLSTTEGSHAFLPCKAKObt'triNii wujsj^ov 
PVSGAEGKFTIQPSGELLVKNLEGQDAGTYT 
(^AENAVGRAi^KKVrlb I li-, v i^r vr i i Lrui^ivo 
LRLGDRLWLR 


308 


1658 


A 


2951 


1 


407 


PTRPPRVRFDNEFDAESQRKRTTS V SKMERM 
DSSLPEEEEDEDKEAINGSGNAENRERHSESS 
DWMKTVPSYNQTNSSMDFRNYMMRDETLEP 
LPKNWEMAYTDTGMlYrll^rlN i M i i WLUr 
RJ CKKAKAPEDC 


309 


1659 


A 


2954 


2 


179 


QDFLTLTLTEPTGLLYVGAREALFAFSMEALE 
LOGAVRGGAVGGSRACQRARPRGAVLG 


310 


1660 


A 


2959 


1 


419 


QDMMERAIIDTFVGHDV VEPGSYVQMFF Y PU 
YTRDDFLFV1EHMMPLCMV1SWVYSVAMTIQ 
HIV AEKEHRLKEVMKTMGLNN A V HwvAWfi 
TGFVQLSISVTALTAILKYGQVLMHSHVVIIW 
LFLAVYAVATIMFCF 


311 


1661 


A 


2963 


3 


465 


MKPQMPGLGAPNGYGPGRGRAGVPGGFbKK. 

PWVPHLLPFSSPGYLGVMKAQKPGAGEGMK 

pQKPGLRGTLKPQKSGHGHbNOrwi'orojNA 

RVAPMLLPRLPTPGVPSDKEGGWGLKSQPPS 

AVQNGKLPGHQPPNGYGPGAEPGFNGGLEPQ 

KJ 


J 1^ 


1662 


A 


2967 


3 


405 


WLAQEWSPCTVTCGQGLRYRWLC1UHKUM 
HTGGCSPKTKPHIKEECIVPTPCYi^lsJiisa^r v 
EAKLPWFKQAQELEEGAAVSEEPSFIPEAWS 
ACTVTCGVGTQVRIVRCQVLLSFSQSVADLPI 

DECEGPKPA 


■D 1 J 


1663 


A 


2969 


2 


430 


VVADNCRQGYLDALRFLERRGLTKKPVLW l 
LVSKEPPAPADGNWDAGCDQRRK.ljrULdi-iN w 
KVPH V Q VKD VPNFEQLSPELEAALKKACTRD 
PSRWARFWHSGPGQVLTYLLLPCTLPFEYIYF 
RSRRLVVWLPDVPADLWWMQ 


314 


1664 


A 


2971 


422 


33 


LDXSHNALQRLRPGWLAPLFQLRALHLDHNb 
LDALGRGVFVNASGLRLLDLSSNTLRALGRH 
DLDGLG ALEKLLLFTWRLVHLUbHAt' HULio\ 
LSHLYLGCNELASFSFDHLHGLSATHLLTLDL 


315 


1665 


A 


2973 


1 


525 


" ITVSTHASGSPFGLEPQSGWLWVRAALDKbA 
QELYILKVMAVSGSKAELGQQTGTATVRVS1 
LNQNEHSPRLSEDPTFLAV ALM^rro l b vuri^v 
FATDRDSGPNGRLTYSLQQLSEDSKAFRIHPQ 
TGEVTTLQTLDREQQSSYQLLVQVQDGGSPP 
RSTTGTVHVAVLDLNDNT 


316 


1666 


A 


2978 


2 


400 


ELVVELVSAGKSGPERNTYEVQWTGNVPKA 
GTDANVYLTIYGEEYGU 1 UtKri^lvivbUi^iNrv 
FEQG QTDTFTIY A1DLG ALTKiRIRHDNTGNR 
AGWFLDRIDITDMNNE1TYYFPCQRWLAVEE 
pnCQT.SRE 


317 


loo/ 


A 
J\ 


2981 


3 


440 


" VLNCQGRPTRPVRINGDGQEVLYLAESDNVR 
LGCPYVLDPDDYGPNGLDIEWMQVNSNPAH 
HRENVFLSYQUKRINHGSLPHLQHRVRFAAS 
DPSQYDASINLMNLQVSDTATYECRVKKTTM 
ATRKVIVTVQARPAVPMCWTEGQ 


318 
| 319 


1668 


A 


2995 


119 


414 

332 


LPEKEFPIIRKSSSLKVTKCLFTEQPKPIIILRFA 

ENYDARLbRIDIANTLREQVQELFNKTYGKQ 
RRTPGEGHV AAVDREVAGFP VPAEG1 SGET1H 
GFFAYTYGRLV V VLDLHSGAQQHWSGHSAE1 




1669 


A 


2999 


2 
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SEQ ED 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc C=C}'steine, 
D=Aspartic Acid, E— Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unkno\vn, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














STLALSHSAQVLASASGRSSTTAHCQIRVWD 
VSGGLCQHLIFPHSTTVLALAFSPDDRLLVTL 
GDHDGRTLALWGTGHL 


320 


1670 


A 


3000 


693 


322 


IDESTGLI1TVNYLDYETKTSYMMNVSATDQA 
PPFNQGFCSVYITLLNELDEAVQFSNASYEAA 
ILENLALGTEIVRVQAYSIDNLNQITYRFDAY 
TSTQ AKALFKIDAIT VRG WGQG APFFPI 


321 


1671 


A 


3001 


6 


383 


RIPRGKACXTVLGRSTGELEGFASSRLPPQPC 
GWGQSSDLLSRIDLDELMKKDEPPLDFPDTLE 
GFEYAFNEKGQLRHIKTGEPFVFNYREHLHR 
WNQKRYEALGE1ITKYVYELLE1CDCNSICKVS 


322 


1672 


A 


3007 


192 


447 


ERVRNSLFPGRGDSQCACCPSSPVWVFLETGF 

LFPWLFLQVEVIKKAYMQGEVEFEDGENGK 

DGAASPRNVGHNIYILAHQLARH 


323 


1673 


A 


3019 


18 


245 


KELLFYHLIV>TNI7^JFFNTRYAKIHIPnASVSEH 
QP IT WVSFFFDLH 1L VCTFPAGL WFCIKNIND 
ERVFGKRGF 


324 


1674 


A 


3020 


523 


797 


LCYFSARYHQRKIFGILYIFTLSAINRKEPNLFI 
YLFIFFEMESHSVTHAGVQRHNLNSLQPLPPG 
FKRFSCLCFLSSWNYRGAPPGPANF 


325 


1675 


A 


3022 


2 


156 


NDFLPL YFG WVLTKKS SETLRKAGQ VFLEEL 
GNHKAFKKELRQCRWQVGAJL 


326 


1676 


A 


3023 


38 


172 


ICMVRGSKKLISFFPGGPYGILAGRDPSKGLAT 
FCLNKEALKDEFE 


327 


1677 


A 


3027 


I 


385 


LTLEFLLLPAASELAHGKRLACCIVDHKLPEC 
GFYGLYDKILLFKHDPTSANLLQLVRSSGDIQ 
EGDLVEVVLSASATFEDFQIRPHALTVHSYRA 
PAFCDHCGEMLFGLVRQGLKCDGCGLNYI-IK 
RC 


328 


1678 


A 


3030 


13 


569 


n^RPTISCQRPGPGLAAGMLPYTVNFKVSART 

LTGALNAHNKAAVDWGWQGLIAYGCHSLV 

VVIDSITAQTLQVLEKHKADVVKVKWAREN 

YHHNIGSPYCLRLASADVNGKIIVWDVAAGV 

AQCEIQEHAICPIQDVQWLWNQDASRDLLLAI 

HPPNYIVLWNADTGTKLWKKSYADNILSFSF 

D 


329 


1679 


A 


3038 


90 


744 


SVNLPPSLWPWEEAMDSTKSEPLKGSPEAED 

GNIEYKKLVNPSQYRFEHLVTQMKWRLQEG 

RGEAVYQIGVEDNGLLVGLAEEEMRASLKTL 

HRMAEKVGADITVLREREVDYDSDMPRKITE 

VLVRKVPDNQQFLDLRVAVLGNVDSGKSTL 

LGVLTQGELDNGRGRARLNLFRHLHEIQSGR 

TSSISFEILGFNSKGEVHGINGTQWGQTLRMG 

W 


330 


1680 


A 


3040 


3 


397 


LCSTLLLLTIPSWVLSQITLKESGPTLMICPTET 
LTLTCTFSGFSLNTSGVGVAWIRQPPGKALE 
WLALIYWDDDKRYSPSLNDRLTIAJCDTSRNQ 
VVT^TMTNMGPVDTATYY CAQFARGARGSN 
WFDPWGQ 


331 


1681 


A 


3043 


3 


1509 


AGIRHEAPPTTSNRHRRQIDRGVTHLNISGLK 

MPRGIAIDWVAGNVYWTDSGRDVIEVAQMK 

GENRKTLISGMIDEPHATWDPLRGTMYWSD 

WGNHPKIETAAMDGTLRETLVQDN1QWPTG 

LAVDYHNERLYWADAKLSVIGSIRLNGTDPI 

VAADSKRGLSHPFSIDVFEDYIYGVTYINNRV 

FKJHKFGHSPLVNLTGGLSHASDVVLYHQHK 

QPEVTNPCDRKKCEWLCLLSPSGPVCTCPNG 

KRLDNGTCVPVPSPTPrPDAPRPGTCNLQCFN 

GGSCFLNARRQPKCRCQPRYTGDKCELDQC 
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SEQ ID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
KO: of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
n 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
ocation 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A~Alanine OCysteme, 
}=Aspartic Acid. E=Glutamic Acid, 
^Phenylalanine G=Glycine, H=Histidine, 
I^lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V-Valine, W-Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide aeietion. vpubbiuic 
nucleotide insertion 














WEHCRNGGTCAASPSGMPTCRCPTGFTGPKC 

TQQVCAGYCANNSTCTVNQGNQPQCRCLPG 

FLGDRCQYRQCSGYCENFGTCQMAADGSRQ 

CRCTAYFEGSRCEVNKCSRCLEGACVVNKQS 

GDVTCNCTDGRVAPSCLTCVGHCSNGGSCT 

MNSKMMPECQCPPHMTGPRCEEHVFSQQQP 

GHIASILIP 


332 


1682 


A 


3045 


3 


952 


TTTISNFIITQVNRTYCCGTYRAGPMRQISLVG 
AVDEEVGDYFPEFLDMLEESPFLKMTLPWGT 

MPKSPFKRRRSMNEIKNLQYLPRTSEPREVLF 
EDRTRAHADHVGQGFDWQSTAAVGVLKAV 
QFGEWSDQPR1TKDV1CFHAEDFTDVVQRLQ 
t t>t ji-nT\n\rcr\r , \rr\w/\/TiV Ak'I TsJOMRRFCilRY 
ARlQLCDNDrYFIPRNVIHQFKTVSAVCSLAW 
HIRLKQYHPWEATQNTESNSNMDCGLTGKR 

PASE 


333 


1683 


A 


3046 


497 


167 


SACSTGPELPGRATRSLTRPANQKGCDGDRL 
YYDGCAMIAMNGSVFAQGSQFSLDDVEVLT 
ATLDLEDVRSYRAEISSRNLAVSAPVDTCVG 
CSSKTWKVAPFVRAWWRP 


334 


1684 


A 


3053 


37 


276 


r VlTDLEEQLNQLTEDNAELNNQNFYLSKQLD 
EASGANDEIVQLRSEVDHLRREITEREMQLTS 
QKQVKRVNKVVRSLEDF 


335 


1685 


A 


3054 


2 


846 


WD A WGD W SDCSRTCG G G AS YSLRRCLTGR 

NCEGQNIRYKTCSNHDCr PUAtur ka^^^a 

YNDVQYQGKYYEWLPRYNDPAAPCALKCH 

AQGQNLWELAPKVLDGTRCNTDSLDMCISG 

ICQAVGCDRQLGSN AKEDNCG VC AGDG STC 

RLVRGQSKSHVbrJbKKlifclN viAvrLuoao > ,V1 

TVKGPAHLFIESKTLQGSKGEHSFNSPGVFVV 

ENTTVEFQRGSERQTFKIPGPLMADFIFKTRY 

TAAKDSWQFFFYQP1SHQWRQTDFFPCTVT 

CGGG 


336 


1686 


A 


3058 


54 


347 


VVGKQEAGAHSDSCCLLHTPPRLTPAHSRKA 
LRNSRIVSQKDDVHVCIMCLRAIMNYQVSRG 
AWDWRLGSPACPHWGLHKLPRLWDPLSLYP 
VLCWGT 


337 


1687 


A 


3059 


2 


709 


" ILTSLVELTRFETLTPRFSATVPPCWVbVQQt 
QQQRRHPQHLHQQHHGDAAQHTRTWKLQT 
DSNSWDEIIVFELVLPKACMVGHVDFKFVLN 

LEDHKEDILCGPVWLASGLDLSGHAGMLTLT 
ct>t^t \/irn*4 Afim^VRSFT 1H VKL AVNERGTEE1 
CNGGMRPVVRLPSLKHQSNKGYSLASLLAK 
VAAGKEKSSNVKNENTSGTRK 


338 


1688 


A 


3060 


85 


384 


" ' IGAFYNYHVLELLQMLVTGGVSSQLEQHLDK 
DKVYGVADSCTSLLSGRNRCKLGLLSLHETIL 
SDVNPRNTFGQLFCGSLDLFG1LCVGLYRI1DE 
EELNP 


339 


1689 k 


A 


3063 


236 


362 


" CFLCLSGDFMVMTIFFNVSRRFGYVAFQNYV 
PSSVTTMLSWV 


340 


1690 


A 


3065 


3 


1249 


DLWQFTPLHEAASKNRVEVCSLLLSYGADPT 
LLNCHNKSA1DLAPTPQJLKERLAYEFKGHSLL 
QA^UIEADVTRIKKHLSLEMVNFKHPQTHETA 
LHCAAASPYPKRKQICELLLRKGANINEKTKE 
FLTPLHVASEKAHNDVVEVWKHEAKVNAL 
DNLGQTSLHRAAYCGHLOTCRLLLSYGCDPN 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E-Glutamic Acid, 
F=Phenyl alanine, G=Glycine, H=Histidine } 
I^Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valinc, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *-Stop codon. 
/^possible nucleotide deletion, Y=possible 
nucleotide insertion 














IISLQGFTALQMGNENVQQLLQEGISLGNSEA 
DRQLLEAAKAGDVETVKJCLCTVQSVNCRDIE 
GRQSTPLHFAAGYNRVSVVEYLLQHGADVH 
AKDKGGLVPLHNACSYGHYEVAELLVKHGA 
V\nsTVADLWKFTPLHEAAAKGKYEICKLLLQ 
HuAUr 1 KKInRJJCjN J PLDL VKDGDTDJQDLLR 
GD AALLD AAKKG CL AR VKKLS SPDN VNCRD 
TQGRIISTPLHLAGK 


341 


1691 


A 


3070 


1 


547 


G\O.IPSFQNQLFADILAGIESVTSEHNYQTLIA 
NYNYDRDSEEESVINLLSYNIDGIILSEKYHTI 
R 1 VKFLRS A 1 1PV VELMDVQGERLDMEVGFD 
NRQAAFDMVCTMLEKRVRHKILYLGSKDDT 
RDEQRYQGYCDAMMLHNLSPLRMNPRAISSI 
nl^KMQLMRJLJALb AN rDLDu VFC TN 


342 


1692 


A 


3073 


463 


3 


RTNRCRKJPSDADTLVPGDTISLIGTTSLRIDYNE 
IDDNRVTAEEVDILLREGEKLAPVMAXTRILR 
AYSGVRPLVASDDDPSGRNVSRGIVLLDHAE 
RDGLDGFITITGGKLMTYRLIVLAEWATDAVC 
RKLGNTRPCTTADL ALPG SQEPAK VP 


343 


1693 


A 


3075 


250 


1 


LLIYLAIFAPVAMSALAGVKSVQQVRIRAAQS 
LGASRAQVL WFVILPGALPEILTGLRIGLGVG 
WSTLVAAELIAATRGLGFM 


344 


1694 


A 


3076 


2 


138 


LYFDAYLQSLQVAAISTFCCLL1GYPLAWAV 
AHSKPSTRNILLLL 


345 


1695 


A 


3078 


469 


3 


LKIRGQRIELGEIDRVMQALPDVEQAVTHAC 

VINQAAATGGDARQLVGYLVSQSGLPLDTSA 

LQAQLRETLPPHMVPVVLLQLPQLPL1ANGKL 

DRJKALPLPELBLAQAPGRAPKAGSETIIAAAFS 

SLLGCDVQDADADFFALGGIISLLAMRLAT 


J 4(3 




A 


3082 


404 


2 


QNITSKDLDVRLDPQTVPIELEQLVLSFNHM1 

ERIEDVFTRQSNFSADIAHEIRTP1TNLTTQTEI 

ALSQSRSQKJELEDVLYSNLEELTRMAKMVSD 

MLFLAQADNNQL1PEKKMLNLAHEVGKVFD 

QFEALPE 


347 


1697 


A 


3084 


3 


340 


NELTFKEAEISKLYTKVHPAYRTLLEKRQALE 
DEKAKLNGRVTAMPKTQQE1VRLTRDVESGQ 
QVYMQLLNKEQELKITEASTVGDVRIVDPAIT 
QPGVLKPKKGL11LGAI 


348 


1698 


A 


3086 


723 


10 


TQAMVWQQKACAEDDPQLSGRHWLHAATL 

YNIAAYPHLKGDDLAEQAQALSNRAYEEAA 

QRLPGrNlRQMEFTVPGGAPITGFLHMPKGDG 

PFPTVLMCGGLDAMQTDYYSLYERYFAPRGI 

AMLTIDMPSVGFSSKWKLTQDSSLLHQHVLK 

ALPNVPWVDHTRVAAFGFRFGANVAVRLAY 

LESPRLKAVACLGPVVHTLLSGLKCQQQVPE 

M YLUYLAoKJLUiNlHDAblKbb I RhNH 


349 


1699 


A 


3087 


2 


249 


RIRSSDPEITLAGTPLHAAYL1GMTLICAGFSV 
GFGVAMSQALGPFSLRAGVASSTLGIAQVCG 
SSLWIWLAAWGIGAWNM 


350 


1700 


A 


3099 


3 


424 


EAPEATPQPSQPGPSSPISLSAEEENAEGEVSR 

NQRGIQYIDLSSDSEDVVSPNCSNTVQEKTFN 
KDTVnVSEPSEDEESQGLPTMARRNDDlSELE 
DLSGMEDLK 


351 


1701 


A 


3108 


2 


404 


IKKNHIIGYQLLHRRALFEICRTRLSDYALIFG 

MFGIVVMVIETELSWGAYYKAPLYSLALKCL 

1SLFTIILLGLTIVYHARE1QLFMANYGADDWR 

SALTYFPIFLILLEALRGVIHATPCRVSLSLWD 

GLDLP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO : 
in 

XJSSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine 5 K=Lysine, L^Leucine, 
M=Methionine 3 N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valinc, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


352 


1702 


A 


3110 


341 


2 


AQLAEVCPPQTLLTTNTSSIS1TA1AAEIKNPER 
VAGLHFFNPAPVMKLVEWSGLATAAEVVE 
QLCELTLSWGKQPVRCHSTPGFIVNRVARPY 
YSEAWRALEEQVAAPEVI 


353 


1703 


A 


3111 


3 


188 


HFSLFR1AFAVFLTYMTVGLPLPVIPLFVHHEL 
GYGNTMVG1AVGIQFLATVLTRGYAGRLA 


354 


1704 


A 


3116 


367 


225 


WQLFI ILNGTFLNIGETDTESC VNG W VYDRSS 
FPFSNMTEVRGLVFLS 


355 


1705 


A 


3117 


101 


53 


" VINL V Y1J SSPRPELKP VDKESEVVMKFPDGF 
EKFSPP1LQLDEVDFYYDPKHVIFSRLSVSADL 
ESRICVVGENGAGKSTN4LKLLLGDL\APVRGI 
RHAJHRNLKiGYFSQHHVGAAGT* IJrbACuXNL 
LGTQVFLGRPEEEYVRHQLGFGMGISGELGHA 
S SLP ACLG G QKEAEVAFCSDGLLPCPNFLML\ 
DEPTNXHLGHGRAIEALGPCLQTISGVGYILVS 
HE* SALSRLVCRE\L\WC*GRSTSPF 


356 


1706 


A 


3121 


137 


466 


RGGRDWGEHNQRLEEHQARAWQGAMDAG 
AASREHARWQGTGLAPGTRVAVAPTCVQGL 
PQERSVCRPFFSSRWREGPVWALGAGAHGKP 
RWSGGVRCVVRGGRWFTPAPFI 


357 


1707 


A 


3124 


1249 


229 


' MLEAPGPSDGCELSNPSASRVSCAGQMLEVQ 
PGLYFGGAAAVAEPDHLREAGITAVLTVDSE 
EPSFKAG PG VEDL WRLFVP ALDKPETDLLSH 
LDRCVAFIGQARAEGRAVLVHCHAGVSRSV 
A1ITAFLMKTDQLPFEKAYEKLQILKPEA1CMN 
EGFEWQLKLYQAMGYEVDTSSAIYKQYRLQ 
KVTEKYPELQNLPQELFAVDPTT V SQGLKDE 
VLYKCRKCRRSLFRSSSILDHREGSGPIAFAH 
KJRMTPSSMLTTGRQAQCTSYFIEPVQWMESA 
LLGVMDGQLLCPKCSAKLGSFNWYGEQCSC 
GRWITPAFQIHKNRVDEMKILPVLGSQTGKI 


358 


1708 


A 


3127 


816 


139 


EVETLGPRTPGP/EAQSPTPGSCPGWQEPSPGP 
TPPP*LSGPGPQGAPVEGKLLPDPEETPAGKTP 
LGKHFW WGLAP VTS ANFSPGAAA*r uOAi^SF^ 
GGDL/GHMLLQGPPSPFRLQQQ* QTPPGSHSP 
PTANREINPGPAAAADTRSCWGHKRSWRGW 
RGLAPWRLGFGSPGIP*PAPAGIP/GRPTWEGG 
KG AGGKPSETLTRSPP V WRukKub AJN UfLS w 
VQILQ 


359 


1709 


A 


3132 


3 


191 


HEHLLLLLLCVFLVKSQGVNDNEEGFFSARG 
HRPLDKKREDAPNLRP.ALADV1TVCDYRAQIA 
*AASTPKRAASIAHNAVSCR*AQIA 


360 


1710 


A 


3134 


1 


286 


" "REPPRPALLFF*DRVSLCCPGWNAVVQSQLT 
AAPTSQ VQ/SD SPTFPSS WDYRHVPEYPANFL 
* RQGFPMLPRLVSNS W AQT VHPPRPPKVLDL 
QA 


361 


1711 


A 


3135 


56 


1449 


" PVPAPRVSPSARGAPGRPRLPGVRGPRHS/WA 
AJD*RGSRM/PPRAPAPSPTGP/APGGKKVRGR 
VPEDPDAYEPRCSAL*V*PTHVTSPQFCDP*N 
GQIRSYTTVLLRGLNETMLVK/PLCRREP/PEA 
ripp.p n^TP A VTRDHROHEDPRGAGRQWDAD 
PRPSAP/PAEVATGSRPGRHMWMRLCLAAQQ 
APGLPHRTSIRPGWRRLTEPEAWARRHRRPW 
GQRGAVRPPPQGAAPPPSHQGRRTNTDPSA1 
PRLTVMSRCLAPDLKAPASGPRGWRRGMPQ 
SS/GALLWTPPPTPRGSHSPRPREAPLRAIHPA 
GPSK/SRAGASGRLPEVIYGWVTLFTPPEAGT 
F/LIPSPT*MSPALVIQPPVPPTQMGLRISGLPR 
QG*PSGAPW*LPGLAQLAFQCHLPHDEVGPP 
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in 
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nucleotide 
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ng to fust 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenyl alanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine s N=Asparagine 5 P=ProIme, 
Q=Glutamine, R=Arginine, S=Serine, 
T—Threonine, V=Valine 3 W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














RNQSPLGNDTLSSGLPMGPRRQVWPLARVG 
GHSSPREPQVLKJCPLWGQTDIAGVGSASLYP 
DNL 


362 


1712 


A 


3136 


1270 


274 


RVGMVLGTREVGDSTPPPSPPLYPFTGNEFVQ 

HNTWQLSRVYPSDLRTDSSNYNPQELWNAG 

CQM/V*GGSRDWEEGVEEQQVGNKFSSDGR 

VGECSRKLLG*EMLSVDITSRYRAPSTYLLNS 

LKEGLEGLHGESCSSFLLGPSVAMNMQTAGL 

EMDICDGHFRQNGGCGYVLKPDFLRDIQSSF 

HPEKPISPFKAQTLLNQVISVQQLPKVDKTKE 

GSIVDPLVKVQIFGVRLDTARQETNYVENNG 

FNPYWGQTLCFRVLGPDFPMLRFGKMDYDW 

KSRNDLLGKTPCPGTCMQQGYRHIHLLSKDG 

ISLRPASIFVYICIQEGLEGDES 


363 


1713 


C 


3139 


60 


248 


MFAGSYGKSMFSFSKKVLNCLPKWRYHFVIA 
P AMN ESPL APHLHQHLV FS VFQVLTILIG V* * 


364 


1714 


A 


3140 


57 


418 


S AFKTLQLP AFSL YFDLG SLKLLILRJHTSIVK 
NHKVESPRTMSPG*DPQSFLQIPQPRPPQLRV 
GLTSGLIQHFHSPSSCQFPLLRGPPFPRQPPLG1 
SGASLCPVLSPPR*PLQPSSL 


365 


1715 


A 


3145 


122 


413 


LLPYPSLFVFLRQCHFVTXRLECNGVVSAHCN 
LHLPGSSDSPASAS*VAGTTGVCHHTRLIF\VF 
LV*TGFHYVAQAGLELLTA*S\PPQLPKWGL 
QA 


366 


1716 


A 


3150 


247 


2 


VGEKLHDIRFGNDFDMTPKAQATKEKIDKLN 
FIBCIKKLCIEGYY/NREPQNGRICIFANYVS\DK 
GLMATIYEELLKJLSNKLIQ 


367 


1717 


A 


3152 


3 


2367 


QKLKQNQPKRAHVEDGGSRSKQGNEQSKKT 

PIEKSDFAAATHPRAFYLSKPDETPNAWMSD 

SGTGLTYWKLEEKDMHHSLPETLEKTFISLSS 

TDVSPNQVLTLDPTLHMKPKQQ1SG1QPHGLP 

NAJLDDRISFSPDSVLEPSMSSPSDIDSFSQASN 

VTSQLPGFPKYPSHTKASPVDSWKNQTFQNE 

SRTSSTFPSVYTITSNDISVNTVDEENTVMVAS 

ASVSQSQLPGTANSVPEC1SLTSLEDPVILSKIR 

QNLfCEKFL^RHIADLRAYYESEINSLKQKLEA 

KEISGVEDWKJTNQILVDRCGQLDSALHEATS 

RVRTLENKNNLLEIEVNDLRERFSAASSASKI 

LQERIEEMRTSSKEKDNT1IRLKSRLQDLEEAF 

ENAYKLSDDKEAQLKQENKMFQDLLGEYES 

LGKEHRRVKDALNTTENKLLDAYTQISDLKR 

MISKLEAQVKQVEHENMLSLRHNSRIHVRPS 

RANTLATSDVSRRKWLIPGAEYSIFTGQPLDT 

QDSNVDNQLEETCSLGHRSPLEKDSSP/GSSST 

SLLIKKQRETSDTPIMRALKELDEGKIFKNWG 

TQTEKEDTSNSLL*/INPRQTETSVNASRSPEK 

CAQQRQKRLNSASQRSSSLPPSNRKSSTPTKR 

EIMLTPVTVAYSPKRSPKJENLSPGFSHLLSKN 

ESSPIREBCTYSEKATDNHVNHSSCPEPVPNGV 

KKVSVRTAWEKNKSVSYEQCKPVSVTPQGN 

DrbY 1 AK1K1 L.Abl bRrrDbL I KJiKDQLbAAL 

SRMPSPGGIUTLQTRLNQVKCLSLNLL 


368 


1718 


A 


3163 


2 


2350 


EFKSGGCGAGLVAAGAVLVLYPASRAGERT 

RVPGSPAPSSLPLHSPGACGTEVDMDPQRSPL 

LEVKGNIELKRPL1KAPSQLPLSGSRLKRRPDQ 

MEDGLEPEKKRTRGLGATTKITTSHPRVPSLT 

TVPQTQGQTTAQKVSKKTGPRCSTAIATGLK 

NQKP VP A VP VQKSGTSG VPPMAG GKKPSKRP 

AWDLKGQLCDLNAELKRCRERTQfLDQENQ 
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QpO ID £ 
NO: of r 
nucl- i 
eotide s 
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vlct £ 
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i 
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DNO: I 
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JSSN 1 
39/496 < 
?14 i 


Predicted I 
rcginning i 
lucleotide 1 
ocation < 
correspond i 1 
fig to first 
amino acid 
residue of 
peptide 
sequence 


>redicted end i 
nucleotide \ 
ocation 
iorresponding 
o last amino 1 
acid residue 
Df peptide 
sequence 


\mino acid sequence (A=Alanine C=Cysteme, 
>=Aspartic Acid, E=Glutamic Acid, 
^Phenylalanine, G^Glycine, H=Histidine, 
=lsoleucine, K=Lysine, L=Leucine, 
Vl=Methionine, N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
r=Threonine, V=Valine, W=Tryptophan, - 
Y=Tyrosine, X=Unknown, *=Stop eodon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion _ 














QLQDQLRD AQQQVKALGTERTTLECiHL AR v 

QAQAEQGQQELKNLRACVLELEERLSTQEGL 

VQELQKKQVELQEERRGLMSQLEEKERRLQT 

SEAALSSSQAEV ASLRQETVAQAALLTEREER 

LHGLEMERRRLHNQLQELKGNIRVFCRVRPV 

LPGEPTPPPGLLLFPSGPGGPSDPPTRLSLSRSD 

ERRGTLSGAPAPPTRHDFSFDRVFPPGSGQDE 

VFEEIAMLVQSALDGYPVCIFAYGQTGSGKTF 

TMEGGPGGDPQLEGLIPRALRHLFSVAQELSG 

QGWTYSFVASYVEIYNETVRDLLA 1 u I KJvu^ 

GGECEIRRAGPGSEELTVTNARYVPVSCEKEV 

DALLHLARQNRAVARTAQNERSSRSHSVFQL 

QISGEHSSRGLQCGAPLSLVDLAGSERLDPGL 

ALGPGERERLRETQAIN SSLb 1 LOL v imal^h 

KESHVPYRNSKLTYLLQNSLGGSAKMLMFV 

NISPLEENVSESLNSLRFASKVEPSVLFGTAQS 

NRKWKTDPDLCVCVCVCVCVCVCVCVCVP 

MSMYRVRGGRV AGOCrlOVVKAr'Ci kaiiv 


369 


1719 


A 


3165 


365 


12 


G YTSQGRWID1ERGPLT ANTESLI IENNFN ALP 
GYIRKIE*1*IYKKN*INFGGVGLLNIVKIS1LS/K 
IYRFDAIPVKILTRFFrNLDKLlLKr VLK I kipjs. 
NRIKTFY1MRRKKLGDSS 


370 


1720 


A. 


3170 


393 


42 


GASISPSAVIDGVEGLKPMQEQEAQEAGPCLD 
*HMAPEQWVAPR\RLLFRLIFSVLHALIIAAAA 
QSSAEEDEDPRN*GQSSEDQAPNl^NuL,i v i v n 
RVHVPLGAAATVPVHRSHFPR 


371 


1721 


A 


3173 


770 


510 


GNGGCGLSQlPPSHLGAFSRGSLLSRUXDPKUr 

PPHPVIFFVFVVE\QGFTVLARMVS1S*PCDPP 

ALASOSAGITGVSHLARPQNLYF 


372 


1722 


A 


3180 


3S1 


76 


RVLMllDNVPAHSSPQICREISQEFQLEIRHLP^ 
PDLAPSGCFLFLNLKNIFKXGTHFSLVDNVKK 
TVSTWLH/SQNAQFYKDRLNGWYHCLQKCL 
QHY*AYVEK 


373 


1723 


A 


3181 


410 


14101 


' RREVAGPEGKGLLLASAHTMLTPPLLJL^r'i.i. 
S AL V AAADD APKTCSPKQF ACRDQ1TCISKGW 
RCDGERDCPDGSDEAPEICPQSKAQRCQPNE 
HNCLGTELCVPMSRLCNG VQDCMDG SDEGP 
HCRELQGNCSRLGCQHHCVPTLDGPTCYCNS 
SFQLQ ADGKTCKDFDECSV YGTCSQLCTNTD 
GSFICG C VEG YLLQPDNRSCKAKNEP VDRPP 
VLLIANSQNILATYLSGAQVSTITPTSTRQTTA 
MDFSYANETVCWVHVGDSAAQTQLKCARM 
PGLKGFVDEHTINISLSLHHVEQMAIDWLTGN 
FYFVDDIDDRIFVCNRNGDTCVTLLDLELYNP 
KG1ALDPAMGKVFFTDYGQIPKVERCDMDG 
ONRTKLVDSKIVFPHGITLDLVSRLVYWADA 
YLDYIEWDYEGKGRQTI1QGILIEIILYGLTVF 
ENYLYATNSDNANAQQKTSVIRVNRFNSTEY 
QVVTRVDKGGALHIYHQRRQPRVRSHACEN 
DQYGKPGGCSD1CLLANSHKARTCRCRSGFS 
LGSDGKSCKKPEHELFLVYGKGRPGIIRGMD 
MGAKVPDEHMIPIENLMNPRALDFHAETGFI 
YF ADTTS YLIGRQKIDGTERETILKDGIHNVE 
GVAVDWMGDNLYWTDDGPKKT1SVARLEK 
AAQTRKTLIEGKMTHPRA1 V VDPl >N GWMYW 
TDWEEDPKDSRRGRLERAWMDGSHRDIFVT 
SKTVLWPNGLSLDIPAGRLYWVDAFYDRIETI 
LLNGTDRKIVYEGPELNHAFGLCHHGNYLFW 
TEYRSGSVYRLERGVGGAPPTVTLLRSEVRPPI 
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SEQ ID 
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seq- 
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SEQ ID 
NO: of 
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seq- 
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nucleotide 
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ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspariic Acid, E=GIutamic Acid, 
F=Phenyla!anine, G=Glycine, h=H\stidine, 
I=Isoleucine, K= Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline s 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y^Tyiosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possibie 
nucleotide insertion 














FEIRVMYDAQHQQVGSNKCRVNNAGCSSLCL 

ATPGSRQCACAEDQ VLDADG VTCL ANPS V VP 

PPQCQPGEFACANSRCIQERWKCDGDNDCLD 

NSDEAPALCHQHTCPSDRFKCENNRCIPNRW 

LCDGDNDCGNSEDESNATCSARTCPPNQFSC 

ASGRC1PISWTCDLDDDCGDRSDESASCAYPT 

CFPLTQFTCNNGRCININWRCDNDNDCGDNS 

DEAGCSHSCSSTQFKCNSGRCIPEHWTCDGD 

NDCGDYSDETHANCTNQATRPPGGCHTDEF 

QCRLDGLCIPLRWRCDGDTDCMDSSDEKSCE 

GVTHVCDPSVKFGCKDSARC1SKAWVCDGD 

NDCEDNSDEENCESLACRPPSHPCANNTSVC 

LPPDKJLCDGNDDCGDGSDEGELCDQCSLNN 

GGCSHNCSVAPGEGIVCSCPLGMELGPDNHT 

CQIQSYCAKHLKCSQKCDQNKFSVKCSCYEG 

WVLEPDGESCRSLDPFKPFIIFSNRHEIRRJDLH 

KGDYSVLVPGLRNTI ALDFHLSQSALY WTD V 

VEDKIYRGKLLDNGALTSFEVVIQYGLATPEG 

LAVDWIAGNIYWVESNLDQIEVAKLDGTLRT 

TLLAGDIEIIPRAIALDPRDGILFWTDWDASLP 

RIEAASMSGAGRRTVHRETGSGGWPNGLTV 

DYLEKJULWIDARSDAIYSARYDGSGHMEVL 

RGHEFLSHPFAVTLYGGEVYWTDWRTNTLA 

KANKWTGHNVTVVQRTNTQPFDLQVYHPSR 

QPMAPNPCEANGGQGPCSHLCLINYNRTVSC 

ACPHLMKLHKDNTTCYEFKKFLLYARQMEIR 

GVDLDAPYYNYUSFTVPDIDNVTVLDYDARE 

QRVYWSDVRTQAIKRAFINGTGVETVVSADL 

PNAHGLAVDWVSRNLFWTSYDTNKKQINVA 

RLDGSFKN A WQGLBQPHGL VVHPL RGKL Y 

WTDGDNISMANMDGSNRTLLFSGQKGPVGL 

AIDFPESKLYWISSGNHTINRCNLDGSGLEVID 

AMRSQLGKATALAIMGDKLWWADQVSEKM 

GTCSKADGSGSWLRNSTTLVMHMKVYDES1 

QLDHKGTNPCSVNNGDCSQLCLPTSETTRSC 

MCTAGYSLRSGQQACEGVGSFLLYSVHEG1R 

GIPLDPNDKSDALVPVSGTSLAVGIDFHAEND 

TIYWVDMGLSTISRAKRDQTWREDVVTNGIG 

RVEGIAVDW1AGNIYWTDQGFDVIEVARLNG 

SFRYVVISQGLDKPRAITVHPEKGYLFWTEW 

GQYPRJERSRLDGTERVVT ,VN VSIS WPNGISV 

D YQDGKL Y WCD ARTDKIERIDLETGEN RE VV 

LS SNNN4DMFS VSVFEDFIYWSDRTHANGSIK 

RGSKDNATDSVPLRTGIGVQLKD1KVFNRDR 

QKGTNVCAVANGGCQQLCLYRGRGQRACA 

CAHGMLAEDGASCREYAGYLLYSERTILKSI 

HLSDERNLNAPVQPFEDPEHMKNVIAJLAFDY 

RAGTSPGTPNRJFFSDIHFGMIQQINDDGSRRIT 

IVENVGSVEGLAYHRGWDTLYWTSYTTSTJT 

RI ITVDQTRPGAFERETVITMSGDDI-IPRAFVL 

DECQNLMFWTNWNEQHPSIMRAALSGANVL 

I LliiKJUlKl rNOLAlUHiOVEKLYr SDATLDKIE 

RCEYDGSHRYVILKSEPVHPFGLAVYGEH1F 

WTDWVRRAVQRANICHVGSNMICLLRVDIPQ 

QPMGIIAVANDTNSCELSPCRINNGGCQDLCL 

LTHQGHVNCSCRGGRILQDDLTCRAVNSSCR 

AQDEFECANGECINFSLTCDGVPHCKDKSDE 

KPSYCNSRRCKKTFRQCSNGRCVSNMLWCN 

GADDCGDGSDEIPCNKTACGVGEFRCIOGTC 

1GNSSRCNQF VDCEDASDEMNCS ATDC S S YF 
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| NO: of 
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ID NO: 

in 
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Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G^Glycine, H=Histidine, 
I=Isoleucine, K=Lysine : L=Leucine, 
M=Methionine, N=Asparagine, P=Proline 3 
Q=Gtutamine, R=Arginine, S=Serine, 
T=Threonine ; V=Valine, WKTryptophan, 
Y=Tyrosine, X=Unkno\vn, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














RLGVKGVLFQPCERTSLCYAPSWVCDGAND 
CGDYSDERDCPGVKRPRCPLNYFACPSGRCIP 
MSWTCDKEDDCEHGEDETHCNKFCSEAQFE 
CQNHRC1SKQWLCDGSDDCGDGSDEAAHCE 
GKTCGPSSFSCPGTHVCVPERWLCDGDKDCA 
DGADESIAAGCLYNSTCDDREFMCQNRQC1P 
KHF VCDHDRDC ADG SDESPECEYPTCGPSEF 
RCANGRCLSSRQWECDGENDCHDQSDEAPK 
NPHCTSPEHKCNASSQFLCSSGRCVAEALLCN 
GQDDCGDSSDERGCHTNECLSRKLSGCSQDC 
EDLK1GFKCRCRPGFRLKDDGRTCADVDRCS 
TTFPCSQRCINTHGSYKCLCVEG YAPRGGDP 
HSCKAVTDEEPFLIFANRYYLRKLNLDGSNY 
TLLKQGLNNAVALDFDYREQMIYWTDVTTQ 
GSMIRRMHLNG SNVQVLHRTGLSNPDGLAV 
DWVGGNLYWCDKGRDTIEVSKLNGAYRTVL 
VSSGLREPRALVVDVQNGYLYWTDWGDHSL 
1GRIGMDG SSRSVIVDTKITWPNGLTLDYVTE 
RIYWADAREDYIEFASLDGSNRHVVLSQDIPH 
IFALTLFEDYVYWTDWETKSINRAHKTTGTN 
KTLLISTLIIRPMDLHVFHALRQPDVPNHPCK 
VNNGCCSNLCLLSPGGGHKCACPTNFYLGSD 
GRTCVSNCTASQFVCKNDKCIPFWWKCDTE 
DDCGDHSDEPPDCPEFKCRPGQFQCSTGICTN 
PAF1CDGDNDCQDNSDEANCDIHVCLPSQFK 
CTNTNRCIPGIFRCNGQDNCGDGEDERDCPE 
VTCAPNQFQCSITKRCIPRVWVCDRDNDCVD 
GSDEPANCTQMTCGVDEFRCKDSGRCIPARW 
KCDGEDDCGDGSDEPKXECDERTCEPYQFRC 
KNNRCVPGRWQCDYDNDCGDNSDEESCTPR 
PCSESEFSCANGRCIAGRWRCDGDHDCADGS 
DEKDCTPRCDMDQFQCKSGHCIPLRWRCDA 
DADCMDGSDEEACGTGVRTCPLDEFQCNNT 
LCKPLAWKCDGEDDCGDNSDENPEECARFV 
CPPNRPFRCKNDRVCLWIGRQCDGTDNCGD 
GTDEEDCEPPTAHTTHCKDKKEFLCRNQRCL 
SSSLRCNMFDDCGDGSDEEDCSIDPKLTSCAT 
NASICGDEARCVRTEKAAYCACRSGFHTVPG 
QPGCQDINECLRFGTCSQLCNNTKGGHLCSC 
ARNFMKTI INTCKAEG SEYQVLYIADDNEIRS 
LFPGHPHSAYEQAFQGDESVRJDAMDVHVKA 
GRVYWTNWHTGTISYRSLPPAAPPTTSNRHR 
RQIDRGVTHLNISGLKMPRGIAIDWVAGNVY 
WTD SGRD VIEV AQMKGENRKTLIS GMIDEPH 
AJVVDPLRGTMYWSDWGNHPKIETAAMDGT 
LRETLVQDNIQWPTGLAVDYHNERLYWADA 
KLSVIGSIRLNGTDPIVAADSKRGLSHPFSIDV 
FEDYIYGVTYINNRVFKIHKFGHSPLVNLTGG 
LSHASDWLYHQHKQPEVTNPCDRKKCEWL 
CLLSPSGPVCTCPNGKRLDNGTCVPVPSPTPP 
PDAPRPGTCNLQCFNGGSCFLNARRQPKCRC 
QPRYTGDKCELDQCWEHCRNGO I CAAbrau 
MPTCRCPTGFTGPKCTQQVCAGYCANNSTCT 
VNQGNQPQCRCLPGF1 XiDRCQYRQCSGYCE 
NFGTCQMAADGSRQCRCTAYFEGSRCEVNK 
CSRCLEGACWNKQSGDVTCNCTDGRVAPS 
CLTCVGHCSNGGSCTMNSKMMPECQCPPHM 
TGPRCEEHVFSQQQPGHIASILIPLLLLLLLVL 
VAGVVFWYKRRVQGAKGFQHQRMTNGAM 
1 NVEIGNPTYKMYEGGEPDDVGGLLDADFAL 
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to last amino 
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of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteinc, 
D=Aspartic Acid, E=Glutamic Acid, 
F^Phenyl alanine, G-Glycine, H-=Hislidine 9 
r=IsoIeucine, K^Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine. V=Valine, W-Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop cod on, 
/^possible nucleotide deletion, ^possible 
nucleotide insertion 














DPDKPTNFTNPVYATLYMGGHGSRHSLASTD 
EKRELLGRGPEDEIGDPLA 


374 


1724 


A 


3187 


191 


1815 


CLELASAGKIPEESKALSLLAPAPTMTSLMPG 

AGLLPIPTPNPLTTLGVSLSSLGAIPAAALDPNI 

ATLGEIPQPPLMGNVDPSKIDEIRRTVYVGNL 

N SQTTTADQLLEFFKQ VGE VKF VRM AGDET 

QPTRFAFVEFADQNSVPRALAFNGVMFGDRP 

LKINHSNNAIVKPPEMTPQAAAKELEEVMKR 

VREAQSFISAAIEPGWLHSTSLCNDFLGCF*RR 

RMYRE*APCTICGTFHLCLIINWDL*LF*AYTA 

K*FFPPRVWKEQ*KKRR\RSRSHTRSKSRSSSK 

SHSRRKRSQSKHRSRSHNRSRSRQICDRRRSK 

SPHKKRSKSRERRKSRSRSHSRDKRKDTREKI 

KEKERVKEKDREKEREREKJEREKEKERGKN 

ICDRDICEREKDREICDKEKDREREREKEHEKD 

RDKEKJEKEQDKEKEREKDRSKEIDEKRKKDK 

KSRTPPRSYTvfASRRSRSSSRERRRRRSRSSSRS 

PRTSKTIKRKSSRSPSPRSRNKKDKJCI<£KERD 

HISERRERERSTSMRKSSNDRDGKJEKLEKNST 

S 


375 


1725 


A 


3192 


415 


101 


AHSSHQTRAILQEFQWDIIRHPPLASPNLALSG 
RFPNLKKSLRGTHFSS VKK\TTLT WLNSQDP 
WF/FFYP*SPDLQIPSSFRNGLNDWYHHSQKC | 
PDLDGAYVKJC 


376 


1726 


A 


3199 


931 


418 


GV*WCDLGSPQPPPPGFKQFCEGRSSSWDYR 
HVPPHPANFVFLLETGFLHAGQAGLAGDPPAS 
ASQSAGITGVSHTWPKNHLIFYACLVIRSKRI 

K 


377 


1727 


A 


3201 


274 


1285 


KTGYTSRGSPLSPQSSIDSELSTSELEDDSISM 

GYKLQDLTDVQIMARLQEESLRQDYASTSAS 

VSRHSSSVSLSSGKKGTCSDQEYDQYSLEDEE 

EFDHLPPPQPRLPRCSPFQRGIPHSQTFSSIREC 

RRSPSSQYFPSNNYQQQQYYSPQAQTPDQQP 

NRTNGDK/PPKKYA* PSPDAKYNCH* *QH\SSP 

VTVRNSQSFDSSLHGAGNGISRIQSdPSPGQL 

QHRVHSVGHFPVSIRQPLKATAYVSPTVQGSS 

NMPLSNGLQLYSNTGIPTPNKAAASGIMGRS 

ALPRPSL AING SNLPRSICI AQP VRSFLQPPKPL 

SSLSTLRDGNWRDGCY 


378 


1728 


A 


3202 


1 12 


1789 


VPGVTESRPSVLRGDHLFALLSSETHQEDPIT 

YKGFVHKWELDRVKLSFSMSLLSRFVGWG* 

PFKVNFY/TFNRQPLRV\QHRALELTGRWLLW 

PMLFPWAPRDVPLLPSDVKLKLYDRSLESNP 

EQLQAMRHIVTGTTRPAPYIIFGPPGTGKTVT 

LVEAIKQVVKHLPKAHILACAPSNSGADLLC 

QRLRVHLPSSIYRLLAPSRDIRMVPEDIKPCCN 

WDA1CKGEYVFPAKJCKLQEYRVL1TTLITAGR 

LVSAQFPIDHFTHIFIDEAGHCMEPESLVAIAG 

LMEVKETGDPGGQLVLAGDPRQLGPVLRSPL 

TQKHGLGYSLLERLLTYNSLYKKGPDGYDPQ 

FITKLLRNYRSHPTILDIPNQLYYEGELQACA 

DVVDRERFCRWAGVLPRQGFPIIFHGVMGKD 

EREGNSPSFFNPEEAATVTSYLKLLLAPSSKK 

GKARLSPRSVGVISPYRKQVEKIRYCITKLDR 

ELRGLDDIKDLKVTCCSTVTPCLPCAPTCPLP 

ETSSSFHSSPRPRPTPAALNRARALPEPLTPGD 

SNLRVWDGIRKPACLTNTSCHS 


379 


1729 


A 


3206 


432 


130 


PKAAPSVXLWFPPFL*GSFKPTKGHTXCVX1K 
* L STRE AXDSXPGRQI AXXRQG GK VETTTAL | 
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SEQID 
NO: of ] 
nucl- 
eotide 
seq- 
uence 


SEQ ID i 
MO: of 1 
peptide 
seq- 
uence 


Met 
fiod 


SEQ i 
[D NO: 
in 

USSN 
09/496 
914 


Predicted 

jeginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end I 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=A!anine OCysteine, 

pv a j Ar>\A — f^liil nin \c Acid 

/VSpartlC AC1U, Cj VjlUloUUi* rt-^na, , 

F=Phenytalanine, G=Glycine, H=Histidine, 
[ — isoieucine 3 is. — i-.ybiiic, *- A/Uu, " > ") 
{^Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine ? S^Serine, 
T=Threoninc, V=Valine, W=Tryptophan, 

V~HTWr»cin<* Y=T7nk'nnWTl *— StOD COdOtt, 

/=possible nucleotide deletion, \=possible 
nucleotide insertion 














XKQSNNKGTRASSYXEPOAXEQWKFPHKKL 
QLPGXTHE 


380 


1730 


A 


3207 


187 


507 


GGTGHPHPARPPLSGVGGCQCSHSKPWTAGS 
PEQRDHPAPHKQIEAGQGLPGPQAWGG*KGP 
AXLLPGPGGGPGPVASLEARAQASSGVTPNG 
GGRT YPYPTF S SGE 


381 


1731 


A 


3225 


1 


840 


rTDurui PAPQnriFPV/Hl *S1PSWGSF*GESL/ 

EMQLITSLGLQEFDIARNVLELIYAQTLVW1G1 

FFCPU-PF1QTV1IMT.FTMFYSKN1SLMMNFQPPS 

vAWPAQnMMTPFTFI 1 FFPSFTGVLC TLAITI 

WRLKPSADCGPFRGLPLFIHSIYSWIDTLSTRP 

GYLWWW I YRNL1G S VHFFFILTL1 VLIIT YL Y 

WQITEGRICMIRLLHEQIINEGKDKMFLIEKLI 

KLQDMEKKANPSSLVLERREVEQQGFLHLGE 

HDGSLDLRSRRSVQEGNPRA 


382 


1732 


A 


3238 


256 


38 


LLMIKVSSTCFSCHLHHHHHHHHRHHQGHN S 
LFFSLKSSSNSSTLPVYLSYNIILVFSKCLVFDF 

LFSNACL 


383 


1733 


A 


3241 


1542 


343 


KGAPSFVRLYQYPNFAGPHAALANKSFFKAD 
KVTMT.WNKKATAVLVIASTDVDKTGASYYG 
EQTLHYIATNGESAVVQLPK.NGPIYDVVWNS 
SSTEFCAVYGFMPAKAT1FNLKCDPVFDFGTG 

PRNAAY YbrHOHlL VJL,/\OrVJiNJ_,ii^i /ax-v uvirv 

VWNVKNYKLISKPVASDSTYFAWCPDGEHIL 
TATCAPRLRVNNGYKIWHYTGSILHKYDVPS 
NAELWQVSWQPFLDGTFPAKTITYQAVPSEVP 
NEEPKVATAYRPPALRNKPITNSKLHEEEPPQ 
NMKPQSGNDKPLSKTALKNQRKHEAKXAAK 
QEARSDKSPDLAPTPAPQSTPRNTVSQSISGDP 
EIDKKIKNLlsXKLKAIEQLKEQ AATG KQLEK 
xrrvr rrrnFPTAT T OFT POT FT GT 


384 


1734 


A 


3242 


3 


678 


TRSPAARSPGLETPTCLLFV1AAIAAVFVDSAIP 

RLIQHRPQDGSFPYT1LDPPLYLPGQCAPPQP 

LSQCARRVHGEKLRRPTFGPRHRGAGTAKMS 

ASLVRATVRAVSKRKLQPTRAALTLTPSAVN 

KIKQLLKDKPEHVGVKVGVRTRGCNGLSYTL 

EYTKTKGDSDEEVIQDGVRVFIEKJCAQLTLL 

GTEMDYVEDKXSSEFVFNNPNIKGTCGCGES 

FN1 


385 


1735 


A 


3243 


3190 


664 


" VAMGTPRAQHPPPPQLLFLILLSCPWigULFL 
KEEEILPEPG SETPT V ASEAL AELLHG ALLRR 
GPEMGYLPGPPLGPEGGEEETTTT1ITTTTVTT 
TVTSPVLCNNNISEGEGYYESPDLGSPVSRTL 
GLLDCTYSIHVYPGYGIEIQVQTLNLSQEEELL 
VLAGGGSPGLAPRLLANSSMLGEGQVLRSPT 
NRLLLHFQSPRVPRGGGFRIHYQAYLLSCGFP 
PRPAHGDVSVTDLHPGGTATFHCDSGYQLQG 
EETLICLNGTRPSWNGETPSCMASCGGTIHNA 
TLGRIVSPEPGGAVGPNLTCRWVIEAAEGRRL 
HLHFERVSLDEDMDRLMVRSGGSPLSPV1YDS 
DMDDVPERGL1SDAQSLYVELLSETPANPLLL 
SLRFEAFEEDRCFAPFLAHGNVTTTDPEYRPG 
ALATFSCLPGYALEPPGPPNAIECVDPTEPHW 
NDTEPACKAMCGGELSEPAGWLSPDWPQS 
YSPGQDCVWGVHVQEEKRILLQVE1LNVREG 
DMLTLFDGDGPSARVLAQLRGPQPRRRLLSS 
GPDLTLQFQAPPGPPNPGLGQGFVLHFKEVPR 
NDTCPELPPPEWGWRTASHGDLIRGTVLTYQ 
CEPGYELLGSDILTCQWDLSWSAAPPACQKI 
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SEQID 
NO: or 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
riU. or 
peptide 
seq- 
uence 


Met 
nod 


SEQ 
ID Nu. 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteinc, 
D=Aspartic Acid, E=Glutamic Acid, 
^Phenylalanine, G=Glycine 5 H=Histidine, 
I=lsoleucine, K==Lysine, L=Leucine, 
M=Meth ionine, N=Asparagi ne, P=Pro line, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














MTCADPGEIANGHRTASDAGFPVGSHVQYRC 

LPGYSLEGAAMLTCYSRDTGTPKWSDRVPKC 

ALKYEPCLNPGVPENGYQTLYKHHYQAGESL 

RFFCYEGFEUGEVTITCVPGHPSQWTSQPPLC 

KVTQTTDPSRQLEGGNLALAILLPLGLVIVLG 

SGVYIYYTKLQGKSLFGFSGSHSYSPITVESDF 

SNPLYEAGDTREYEVSI 


386 


1736 


A 


3250 


5725 


3984 


GTSTVTMATKXHFSIILNLLG^LKKDNQDT 

RKLLMTWALEVAWMKKSETYAPLFCLPSF 

HKFCKGLLADTLVEDVNICLQACSSLHALSSS 

LPDDLLQRCVDVCRVQLVHRGTCIRQAFGKL 

LKSIPLGVFLSNNNHTEIQEISLALRSHMSKAP 

SNTFHPQDFSD/VI SFIL YGNSHRTGKDN WLE 

RLFYSCQRLDKRDQSTIPRNLLKTDAVLWQW 

AIWEAAQFTVLSKLRTPLGRAQDTFQTIEGIIR 

SLAGHTLNPDQDVSQWTTADNDEGHGNNQL 

RLVLLLQYLENLEICLMYNAYEGCANALTSPP 

K VIRTFL YTNRQTCQD WLTRIRLSIMR VG LL A 

GQPAVTVRHGFDLLTEMKTTSLSQGNELEVSl 

MMVVEALCELHCPEAIQGIAVWSSSIVGKHL 

LWINSVAQQAEGRFEKASVEYQEHLCAMTG 

VDCCI S SFDKS VLTL AS AG CKS A S LKHCLN GE 

SRKSVLSKPTDSSPEV1NYLGNKACECYISTA 

DWAAVQEWQNAIHDLKKSTSSTSLNLKADF 

N YIKbbbSr hbCrKr VbC IbQLbLLPGENINLLA 

GGSKJEKJDMKKLLRNM 


387 


1737 


A 


3255 


380 


76 


MDIFLYNCKYQVQTEI*NSIQHIMA\SKKLSRF 
LKWHNL*AENYKTL1VIK*INEDLNKQRDVPY 
b ¥ 1 AKbNKJvlMF 1 K i lrI<>KA.lYIKiPArYFIET 
NMQ 


388 


1738 


A 


3260 


685 


428 


PQWLG LQ V YALPPAJNF VFF VEMRSTILAQTG 
FELLDSSDLPASASKSAGUCMSHHARTLSLK 
* WrrCLkA / QhKrC^rA^rXrVAW 


389 


1739 


A 


3269 


1 


332 


LDG YHTPI YMLNRIIRLPAAL* IISDQTGHALTI 
LTRLETQMINADYQNKLTLDYLLTTDREVYE 
PFNLTNYCLIIIHNQRLG AYDLG * V* Q/KL AH V 
PVQV*HGFDPEAMFR 


ion 




A 


j! /U 


2 


372 


GRCmjQNKGKSUXjPDAQAEACGGESTYQEL 
LVNQNPIGQPLACRI^LTRKIYEGIKKAVKPNH 
SPRG VKK VHKF VNKGEKGIM VLAGDTLG IG V 
YCLLPCMC*DRKLTYAHIPSTTDLGAGAGY 


jy i 


1 741 1 
l / *t l 


A 
/A 




-i 
I 


1 0*7 


rrQbJVlJJUlMrx^ 1 PVLKEDAPRQ 
HVETFFQXEELTRSQEGMKl ,GENFLMFAMPP 
DDSICESKGK*FFQEMLDIMKAISDMMGKCTY 
PVLKEDAPRQHVETFFQVGINQKSRGHEVRR 
KFPDVCHAPR 


J/- 


1 74? 

1 / *Ti. 


A 








rrr oL)u V ar CJKQAO V * WriDLDbLQNLPPGFK 
RFSYLSLPSSW\DYRHVLPRQANFC1F/M*RRG 
FTMLARMVSIS*PRDLPALASQSAGITGVSHH 
APPQMDFTFALLCFALKGCLPRQKEGGTLNLI 


393 


1743 


A 


3283 


385 


3 


RNR^VVPFFVI T CiT ^AnPOTOTT T FVI FVVTP 
rvj nxvo v v r LjI v 1jJ-»vj.l> o.rv \j«r i y i l_<L»r v l_,r v V lv_ 

LLTVMGNLLLLVVINADSCLHTPMYFFLGQL 
SFLDLCHSSVTAPKLLENLLSEKKTISVEGCM 
A* VFFVFATGGTESSLL A VMA YDRY V AIRTR 
G 


394 


1744 


A 


3284 


575 


1054 


CTKCKADCDTCFNKNFCTKCKSGFYLIILGKC 
LDNCPEGLEANNHTMECVSIVHCEVSEWNP 
WSPCTKKGKTCGFKRGTETRVREIIQHPSAKG 
NLCPPTNETRKCTVQRKKCQKGERGKKGRE 



174 



5DOCID: <WO 01571 88A2_I_> 



WO 01/57188 



PCT/U SO 1/03800 



SEQ ID I 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 1 
sIO: of 1 
-»eptide 
seq- 
jence 


Viet i 
lutl 


SEQ 1 1 
T>NO: 1 
n i 
USSN 
09/496 
914 


Predicted 1 
beginning i 
mcleotide 
ocation 

correspond i 1 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end i 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


\mino acid sequence (A=Alamne OCysteine, 
[)=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
!=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serinc, 
T=Threonine, V^Valine, W-Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














RKRKKPNKGESKEAlPDSKSLESSKEIPEQKtN 
KQQQ 


395 


1745 


A 


3286 


1 


340 


RVLYVPSMGFCILVAHGWQKISTKSVFKKI.S 
WICLSMVILTHSLKTFHKNWDWESEYTLFMS 

. t »rin.n/\T\T i l/l \"l /XTKT\ If "LI A T TTkTPl<f "KTFPT5 AT 

ALK VN KNN AKL w N N V (jfrlALniN £.kin r cj^aj^ 
KYFLQATHVQPDDIGAHMNVGR 


396 


1746 


A 


3293 


1 


172 


GFRAVVMTVKTEAAKGTLTYSRMRGMVA1L 
1AFMKORRMGLNDFIQK1ANNSYACKQ 


397 


1747 


A 


3295 


12 


401 


AEPACGASSCTPPSLRSSSSQSVGPLRFGRF1- 
W SEACAFL* AAAPQGPASPCCGLPSGFPRVW 
AQCCPPGGALRFPEGLGSVLSPRRCPQVSRGS 
GLSAVPQEVPSGFLGPGLRACPQEAPSRFLRA 

GLT 




1748 


A 


3300 


1912 


276S 


KQRRWQNIQRKGPKRYIV1AGNSQSHQPM1FS 

MLRKLPKVTCRDVLPEIRAICIEEIGCWMQS^ 

STSFLTDSYLKYIGWTLHDKHREVRVKCVKA 

LKGLYGNRDLTARLELFTGRFKDWMVSM1V 

DREYSVAVEAVRLLILILKNMEGVLMDVDCE 

SVYPIV*ASN*GLASAVGEFLYWKLFYPECE1 

RTMGGREQRQSPGAQRTFFQLLLSFFVESKSH 

SVTQAGVQWQFSAHRDLCLPGSSNSHVSASR 

VAG1AGAHRHTWLIYVFFSWRQGFAVLAGL 

VSNS 


399 


1749 


A 


3301 


536 


2391 


LRSYGCKAPSRISHLHKXFLFLLLPSLLMGYSE 

SPPPITDSWAPFISLTHHVLSQSQSPLSSNCWI 

CLSTHTQ*FTALPADLLTWTQSNVSLH1SYLAI 

PFLADSFLKPV/L*PGNSAKHLSFKLSSLSMVS 

GRAVALLHLIASGLTS1QTNTASSKPPIWGYVL 

STQTSFISPPPLCLSRTYPNPAHATMVGQVPQ 

SLCGLIFTL/RTPCRPSILHPNYKIISTSAWQKV 

LCFSGSPT1HTSLHLTTGSSFLSFHPIPGFPAAN 

SALYVSSLKGPPGKNVTIPSPVTGT*QPPHRGS 

N/RLTVDKDNFFLSPKPNSLHQLPSQ\TPYQAL 

TGAALAGSYPIWENENTLSWLPTFTYNFCLST 

PSLFFLCDTN*YLCLPANWSGTCTLVFQAPT1 

NILPPNQTILISVEASISSSPIRNKWALHLITLLT 

GLG1TAALGTG1AGITTSITSYQTLFTTLSNTVE 

DMHTSITSLQRQLOFLVGVILQNWRVLDLLT 

TEKGGTCIYLQEECCFCVNESGIVHIAVRRLH 

DRAAEL*HQVADS W WQOSbLLK w IF w v Art* 

LGPLIFLFLLLMIGPCIFNLVSRFISQRLNCFIQ 

ASMQKHIDN1FHLCHV*YQSLRGNHSEAPEPR 

P 


400 


1750 


A 


3303 


2 


453 


THWRHSSGVPGSlTARRRRRELElATSUNgb 

YYNRLCQEVTNRERNDQKMLADLDDLNRTK 

KYLEERLIELLRDKDALWQKSDALEFQQKLS 

AEERWLGDTEANHCLDCKREFSWMVKKiitiu 

R1CGRIFCYYCCNNYVLSKHGGKKERCC 


401 


1751 


A 

1 


3304 


1 


626 


"MAPQHSSLDDKVPQQASTVCFEFQDILQHSQ 
CTEHKDSLWGPGARSQPFGAHNTRLSPDSCP 
EKIVLRALKDSRAGMPEQDKDPGVQENPDD 
QRRVPQGTGDAPSAFRPLWDNGGLSPFVSRP 
GPLERDLHAQRSEVTYNQRSQSSWMSSFPKR 
NAFVSPYSSMGQAQP/GLPKTNP1GESCCWEG 
LSLSTQ1LG*QKPSKY1PSLCKR 


402 


1752 


A 


3305 


1678 


172 


MELPSGPGPERLFDSHRLPGDCFELLVLLLYA 
P VGFCLL VLRLFLGIHVFL V SC ALPD S VLRRF 
VVRTMCAVLGLVARQEDSGLRDHSVRVLISN 
HV f PFDHNIVNLLTTCSTVSESEAESATGRFP 
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SEQ ID 

JN<J. OI 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

lNU- OI 

peptide 
seq- 
uence 


Met 
nod 


SEQ 

ID N\J. 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A"=AJanine C=Cysteine, 
D-Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine_. H=Histidine, 
I=Isoleucine, K=Lysine, L~Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
QKjlutamine, R=Arginine a S=Serine, 
T=Thrconine. V= Valine, W=Tryptophan, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possib!e 
nucleotide insertion 














GAQLKAPLSPLAFRMEDTEALPLTPILYPTCQ 

FFFFVIFLNIFLLAFSSPGSQPLLNSPPSFVCWSR 

GFMEMNGRGELVESLKRFCASTRLPPTPLLLF 

rh,tSbA 1 NUKbGLLRr SS WPFSIQDVVQPLTLQ 

VQRTLVSVTVSDASWVSELL\WSLFVPFTVY 

QVRWLRPVHRQLGEANEEFALRVQQ\LVAKE 

LG\QTGTRLTPA\DKAEHMKRQRHPR\LRPQS 

AQSSFPPSPWVLSS/SDVQTGQTLGFREFKESF 

L,r H V Al U V r IP h Kr W r K 1 G C C Kl L Tl HLILL * G 

GPVSFSCPE\DIHPRGT*VPTQQASGLPSFPSYG 

PARGG VL* HPSAQQPLTFAVKSSXW ARAGRAL 

QERKQ\AJLYEYARRRFTERRAPGGLD 


403 


1753 


A 


3307 


44 


447 


DPSPSLLAVALGLRAGERTRSGPGSSSPSGGIS 
uuAaAuLAborbtACOKoHr rCAVSALGECT 
CIPAQWQCDGDNDCGDHSDEDGCILPTCSPL 
DFHCDNGKCIRRSWVCDSDNDCEDDSDEQD 
CPPRECEED 


404 


1754 


A 


3311 


409 


1 


PRHGWGRRVLGRDRPRLQKVKKSVKAIYIPG 
QDHVQNEEIYARVLDKFGSNFLSRDNADLGT 
AFVKFSTLTK*LSALLKNLLQGLSRNVIFTLDS 
LLKGDLKGVKGDLKLKPFDKAWKDYETKFAK 
IEKEKREREWR 


405 


1755 


A 


3322 


12 


458 


AAVPVENPWDDPRVRPRVRIFTWEDCIAGQA 

KVLCn^SYGVTIDWSPKGAFIRLTSQSVGNG 

HPASKENDQMVDTIKNTTKVPIIWTYGDMVE 

PRPQMIRPAVGAKHKELWKJLMALKXDCMWE 

GKYTKPSQYNPNYK4LELAHNDSVW 


406 


1756 


A 


3324 


1 


426 


LSMLST1STEHRLSVLWP1WYCCHCPTHLSAV 

MCVLLWALSLLQSHEWMFCSFLFSDVDSDN 

WCQILDFLTAV\VLIFLI\LVLCGFTLVLLVRJIC 

GSQKMPLTRLYVTILLTGLVFLFCSLPLSIQ*F 

LLYWIEKDLDDL 


407 


1757 


A 


3328 


213 


1841 


SGDLSPAELMMLT1GDVIKQLIEAF1EQGKDID 

LNKVKTKTAAKYGLSAQPRLVDIIAAVPPQY 

RKVLMPIO,KAKPIRTASGIAVVAVMCKPHRC 

PHISFTGNICVYCPGGPDSDFEYSTQSYTGYEP 

TSMRAIRARYDPFLQTRHRffiQLKQLGHSVD 

KVEFTVMGGTFMALPEEYRDYFIRNLHDALS 

GHTSNNIYEAVKYSERSLTKCIGIT1ETRPDYC 

MKRHLSDMLTYGCTRLE1GVQSVYEDVARD 

TNRGHTVKAVCESFHLAKDSGFKVVAHMMP 

DLPNVGLERDIEQFTEFFENPAFRPDGLKLYP 

TLV1RGTGLYELWKSGRYKSYSPSDLVELVA 

RILAI.VPPWTRVYRVQRDIPMPLVSSGVEHG I 

NLREL ALARMKDJLGJ QCRD VRTRE VG1QEIH 

HK VRP YQ VEL VRRD YVANGG WETFLS YEDP 

UQD1L1GLLREKKCSEE I r RFELGGGVSIVREL 

HVYGSVVPVSSRJDPTJCFQHQGFGMLLN4EEA 

ERIAREEHGSGKIAVISGVGTRNYYRKIGYRL 

QGPYMVKMLK 


408 


1758 


A 


3335 


3 


467 


KG AKKKA V/DNIIN IGKTLVTRTQRTKIASDG 
LKGR VFEESL ADLQND\TDGYLLR VI* VAFIT 
ERTNQI/REVFNKLIPDSIGKDIEKACQSIYPLH 
DDFARKVKMEKKPKFET .RJCLMELHGEGSS 


409 


1759 


A 


3338 


7 


1252 


PRWRNSARDEILLSFPQNYYIQWLNGSLIHGL 
WNLASLFSNLCLFVLMPFAFFFLESEGFAGLK 
KGIRARILETLGMLLLL.^LLILGIVWVASALID 
NDAASMESLYDLWEFYLPYLYSCISLMGCLL 
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SEQ ID ! 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 3 
MO: of 
peptide 
seq- 
uence 


Met 
tiod 


SEQ J 
[DNO: 1 
in 

USSN 
09/496 
914 


Predicted I 

beginning 

[lucteotide 

ocation 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
lucleotide 
ocation 
corresponding 
lo last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine ; G=Glycine, H=Histidine, 
1-lsoleucine, K=Lysine, L=Leucine ? 
M=Methionine, N=Asparagine, P=Proline ; 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, \V=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














UJXTPVGUSRMFTVMGQLLVKPTILEDLDb 
QIYI1TLEEEALQRP 1 KWAVrlKW/KiiNiimJi*D 
QELENVKTLKTKLERRKKASAWERNLVYPA 
VMVLLLIETSISVLLVACN1LCLLVDETAMPK 
GTRGPGIGNASLSTFGFVGAALEIILIFYLMVS 
SVVGFYSLRFFGNFTPBCKDDTTMTKIIGNCVS 
ILVLSSALPVMSRTLGITRFDLLGDFGRFNWL 
GNFYIVLSYNLLFA1VTTLCLVRKFTSAVREE 
LFKALGLHKLHLPNTSRDSETAKPSVNGHQK 

AL 


410 


1760 


A 


3339 


127 


1433 


GSHRFSLASPLDPEVGPYCDTPTMRTLFNLL 

WLALACSPVHTTLSKSDAKKAASKTLLEKSQ 

FSDKPVQDRGLVVTDLKAESWLEHRSYCSA 

KARDRHFAGDVLGYVTPWNSHGYDVTKVFG 

SKFTQ1SPVWLQLKRRGREMFEVTGLHDVDQ 

GWMRAVRKHAKGL\P*CLGSCLRTGLTMISG/ 

YVLDSEDEIEELSKTVVQVAKNQHrlJUr Wt 

VWNQLLSQKRVGLIHMLTHLAEALHQARLL 

ALLV1PPAITPGTDQLGMFTHKEFEQLAPVLD 

GFSLMTYDYSTAHQPGPNAPLSWVRACVQV 

LDPKSKWRSKILLGLNFYGMDYATSKJDAREP 

VVGARYIQTLKDHRPRMVWDSQVSEHFFEY 

KKSRSGRHVVFYPTLKSLQVRLELARELGVG 

VSI WELGQGLD YF YDU * VGI AAS A VD VFFSK 

PWSE 


411 


1761 


A 


3342 


74 


2701 


" ' V ATRKLAKGFTQF AKMTEGTKKTSKKJ , "KJ ; FK. 
FKGFGSFSNLPRSFTLRRSSASISRQSHLEPDTF 
E ATQDDMVT VPKS PP A Y ARS SDM Y SHMGTM 
PRPSIKKAQNSQAARQAQEAGPKPNLVPGGV 
PDPPGLEAAKEVMVKATGPLEDTPAMEPNPS 
AVEVDPIRICPEVPTGDVEEERPPRDVHSERAA 
GEPEAGSD YVKF SKEKY1LD SSPEKLHKELEE 
ELKLSSTDLRSHAWYHGRIPREVSETLVQRN 
GDFLIRDSLTSLGDYVLTCRWRNQALHFKJN 
KVWKAGESYTHIQYLFEQESFDHVPALVRY 
HVGSRKAVSEQSGAIIYCPVNRTFPLRYLEAS 
YGLGQGS SKPASPVSPSGPKGSHMKRRSVTM 
TDGLTADKVTRSDGCPTSTSLPRPRDSIRSCA 
LSMDQIPDLHSPMSPISESPSSPAYSTVTRVHA 
APAAPSATALPASPVARRSSEPQLCPGSAPKT 
HGESDKGPHTSPSHTLGKASPSPSLSSYSDPDS 
GHYCQLQPPVRGSREWAATETSSQQARSYGE 
RLKELSENGAPEGDWGKTFTVP1VEVTSSFNP 
ATFQSLLIPRDNRPLEVGLLRKVKELLAEVDA 
RTLARHVTKVDCLV AKJLU V 1 Kbivi^ i l,ivhj v 
RWGMELLTLPHG\RKLRLDLLERFHTMSIML 
AVDILGCTGSAEERAALLHKTIQLAAELRGT 
MGNMFSFAA VMG ALDMAQ1 SRLEQTWVTLR 
QRHTEGAILYEKKEKPFLKSLNEGKEGPPLSN 
TTFPHVLPLITLLECDSAPPEGPEPWGSTEHGV 
EWLAHLEAARTVAHHGGLYHTNAEVKLQG 
FQARPELLEVFSTEFQMRLLWGSQGASSSQA 
RRYEKFDKVLTALSHKLEPAVRSSEL 


412 


1762 


A 


3347 


1 


89S 


" IDRAAECRTKPLPMAVSIRGNADS1VACLVLM 
VLYLIKKRLVACAAVFYGFAVHMKIYPETYI 
LPITLHLLPDRDNDKSLRQFRYTFQACL* ELL 
KRLCNRTALMFVAVAGLTFFALSFGFYYEYG 
WEFLEHTYFYHLTRRDIRHNFSPYFYMLYLT 
AESKWSF 5LGIAAFLPQLILLSAVSFAYYRDL 
VFCWFLHTSIFVTFNKVCTSQYFLWYLCLLPL 
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nucl- 
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seq- 
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seq- 
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hod 


SEQ 
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in 

IJSSN 
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nucleotide 
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ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D-Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
1=1 so leucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine 3 P=ProIine. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, W=Tryptophan. 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insenion 














VMPLVRMPWKRAVVLLMLWF1GQAMWLAP 
AYVLEFQGKNTFLFIWLACLFFLLINCSILIQII 
SHYKEEPLTERJKYD 


413 


1763 


A 


3361 


3 


474 


PIPVRWNSLEGRLLRGYEQHANDGKDYISRN 

*DLRSWTAADMAAQITKRKWEAEEFAEQIKA 

YLEGTC\rER/LRTHLENGKETLQLTEQSSQPTI 

PIVGIVAGLVLLGAVVTGAWSAVMCRKKNS 

GHFLPTDRVSYSEAASSDHAQGSDVSLTACK 

V 


414 


1764 


A 


3363 


148S 


453 


HQILELKKICILKTYNPDYDEDLVQEASSEDVL 

GVHMVDKDTERDIEMICRQLRRLRELHLYST 

WKKYQEAMKTSLGVPQRERDEGSLGKPLCP 

PEILSETLPGSVKKRVCFPSEDHLEEFIAEHLP 

EASNQSLLTVAHADAGTQTNGDLEDLEEHGP 

GQTVSEEATEVHMMEGDPDTLAELLIRDVLQ 

ELSSY'NGEEEXDPEEVKTSLGVPQRGDLEDLE 

EHXTGQT V SEE ATG VHMMQ VDPATLAKSDL 

EDLEEHVPEQTVSEEATGVHMMQVDPATLA 

KQLEDSTITGSHQQMSASPSSAPAEEATEKTK 

VEEEVKTRKPKKKTRKPSKKSRWNVLKCWD 

IFNIF 


415 


1765 


A 


3369 


431 


315 


IPVVS WVGRLS VRKMSILF*LTYN YNA1LNKTP 
PSFSPSL 


416 


1766 


A 


3373 


42 


651 


RQEKMGLGE1GASGVLRSMLICER1CKQNMKG 
NGNVTLTPLLPAVQCGCHLQPAGRSPLPSSHS 
APGLCSPLHPLQPQQEASTCPSGTLQGREKAA 
PGQGRPLCSLWAGGAGA\PGERGAEGRGPSD 
QAPDPKSGPWLFPPGLGAPAEVRLHNVPHNL 
RRPPLP*AJRGK*PPNSGCPWSEGRAKQPLSCG 
PKPQCSLPSQVPGDTH 


417 


1767 


A 


3382 


2 


2061 


E AQDPRACGPD AG GRF AARD APGN SLRPPP S 

SPP/GWPGQLRLLPRVPGSELRCGKPERGRLP 

ASPPGICIRGWPPGISICRPGLGGRSFPPGFAPRT 

WRPEARGPSVQSLPPIFSPQSAQTTAR*RPGAP 

KNAGRCGGA\RGPRLSLGPPPGPPPAPALPAR 

ASAGAGAAAAALAVGGVRGAGGARGTGGY 

GHCSGR/PTGRTGPGPQGPGPPMPARPR* AS\S 

TRGSRRGPGSRPARAAAAPRAGDHGRRPVRV 

HLRQHTA V* EPRLGD ATAPPG G AAGPG AP AP 

RXGPGWDCALLPSPGPRSPRAVGCAEPEIWDP 

SPRRGTSPVPSVRSLRSEPANPRLGLPALLNSY 

PLKGPGLPPPWGPRTQTGHVTITVQPSGSCIEH 

SKSLD/RGPWGAPPWGPSSSGJLCSPKLATAGP 

PQSWGLCQIGRRRGLGGPGLKRGET/GLL*GC 

SMDHANRTKGPGVPTSNRCFSHIPG\GDGCSD 

HSSCEGHPDLHAGREMPAAPGLSELERVRFT 

VGCGGLASGISSASVSGLSPNRAGGPGQGDW 

EMYPVSWQTQESGGQG/SPKTGR*VGMLQA 

GAGSLQGGTGDGVWGLWEDGP/RG*DSPLPS 

GTGTEP*TPTTSIPFFPQPSGVYPSRATLLPMPS 

Y*ALGPSANKSEKPLLSFLYRGLCCRISLQLA 

KGIGQLSEIPLLNVETAFWSMWVTYFRK 


418 


1768 


A 


3398 


304 


2121 


EEEEEEEDEDDDDNNEEEEFECYPPGMKVQV 
RYGRGKNQKMYEASUCDSDVEGGEVLYLVH 
YCGWNVRYDEWLICADKIVRPADKNVPKJKH 
RKKIKNKLDKEKDKDEKYSPKNCICPPAJLGPN 
PPFQ'nMPlSWKWYPKLDLTDAKNSDT.AHIKSI 
EITSILNGLQASESSAEDSEQEDERGAQDMDN 
NGKEESKIDHLTNNRNDLISKEEQNSSSLLEE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 
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09/496 
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Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C^Cysteine, 
D^-Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
IHteoleucine, K=Lysine, L^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine r V=Valine, \V=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














NKVHADLVISKPVSKSPERLRKDIEVLSEDTD 
YEEDEVTKKRKDVKKDTTDKSSKPQ1ICRGKR 
RYCNTEECLKTGSPGKKEEKAKNKESLCMEN 
S SNS S SDEDEEETKAKMTPTKK YNGLEEKRK 
SLRTTGF Y SGF SEV AEKJUKLLNNSDERLQNS 
RAKDRJCDVWSSIQGQWPKKTLKELFSDSDTE 
AAASPPHPAPEEGVAEESLQTVAEEESCSPSV 
ELEKPPP VN VD SKPIEEKT VEVNDRKAhr Pb b 
GSNFSA*IPLPYLHLNRLHQSL*QKGSRQQSS 
VTVSEPLAPNQEEVRSIKSETDSTIEVDSVAGE 
LQDLQSERE* LASRF* CQCELKQ* * SARTRTS* 
KSL YRSEKSERCSGRRKFlKJ<LAJbKJ<LP* bN bGK 
QQKEGK 


419 


1769 


A 


3399 


206 


463 


QRECLSIHIGQAGIQIGDACWELYCLEHGIQP 
NGVVLDTQQDQLENAKMEHTNASFDTFFCE 
TRAGKHVPRALFVDLEPTVIDGIR 


420 


1770 


A 


3408 


1010 


685 


RRLSFFF* I WSSVLVTQARVQ WRDLGSPQPLP 
PGFKRFSCLSLPSSWDYRHPSPRPVNF/HVFLV 
VMGFHHVGQAGLELLTSGDLPALASQSARIT 
GVNHCAQPRGHFFI 


421 


1771 


A 


3409 


355 


1326 


ADSNLIESCWQELGLGPWGGDWRVEQVGAS 

ASLRFPREVCS1RFLFTAVSLLSLFLSAFWLGL 

LYLVSPLENEPKEMLTLSEYHERVRSQGQQL 

QQLQAELDKLHKEVSTVRAANSERVAKLVF 

QRLNEDFVRKPDYALSSVGASIDLQKTSFTDY 

ADRNTAYFWNRFSFVvTsfYARPPTVILEPHVFP 

GNCWAFEGDQGQVVTQLPGRVQLSDITLQHP 

PPSVEHTGGANSAPRDFAVFFLLSFFTHQGLQ 

VYDETE V SLGKFTFD VEKSEIQTFHLQNDPPA 

AFPK VKJQILSN WGHPRFTCL YRVRAHGVRT 

SEGAEGSAQGPH 


422 


1772 


A 


3412 


2 


421 


EFDAQPSIGALWFKRP*ATTGSDPGPKRGMN 

YLVSCSMRSPESGKGEPGTARDYTPMGRPPP 

PVPSVSPGPLPGSLAIAPHSPEPHPWEQQPPRG 

QARSPPGGWLGSAT/RVRRPHNHP/RGH/HSP 

VDTAGAPASPGPDVCE 


423 


1773 


A 


3420 


91 


706 


DAQRAIYSSVGPAVSLRQRQQDGAVKESGR/ 
RGGVRSFSRAAAAMAPTKVGDAJPAVEVFEG 
EPGNKVNLAELFKGKKGVLFGVPGAFTPGCS 
KTHLPGFVEQAEALKAKGVQVVACLSVNDA 
FVTGEWGRAHKAEGKVRLLADPTGAFGKET 
DLLLDDSLVSIFGNRRLKRFSMVVQDGIVKA 
LNVEPDGTGLTCSLAPNIISQL 


424 


1774 


A 


3421 


4 


7688 


RQVTRVGTRVLGSTTAAVFLSVEDDNDNAPQ 

FSEKRYVVQVRED\TPGAPVLRVTASDRDKG 

SNAVVHYSIMSGNARGQFYLDAQTGALDVV 

SPLDYETTICEYTLRVRAQDGGRPPLSNVSGL 

VTVQVLDINDNAPIFVSTPFQATVLESVPLGY 

LVLHVQAIDADAGDNARLEYRLAGVGHDFP 

FTINNGTGWISVAAELDREEVDFYSFGVEAR 

Unb 1 rAL 1 AoAo V b V i AJL.U V In L/ IN IN r i r I yrt< 

YTVRLNEDAAVGTSVVTVSAVDRDAHSV1TY 

QITSGNTRNRFSITSQSGGGLVSLALPLDYKLE 

RQYVLAVTASDGTRQDTAQIWNVTDANTH 

RPVFQSSHYTVNVNEDRPAGTTVVLISATDE 

DTGENARITYFMEDSIPQFRIDADTGAVTTQA 

ELDYEDQVSYTLAITARDNGIPQKSDTTYLEI 

LVNDVNDNAPQFLRDSYQGSVYEDVPPFTSV 

LQISATDRDSGLNGRVFYTFQGGDDGDGDFI 
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SEQ ED 
In(J: oi 
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seq- 
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SEQ ID 
NU. OI 
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seq- 
uence 


Met 
nod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alaninc C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G^Glycine, H=Histidine, 
I=lsoleucine, K=Lysine, L^Leucine, 
M=Methionine, N=Asparagine, P=Proline s 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/-possible nucleotide deletion, V=possible 
nucleotide insertion 














VESTSG1VRTLRRLDRENVAQYVLRAYAVDK 

GMPPARTPMEVTVTVLDVNDNPPVFEQDEFD 

VFVEENSPIGLAVARVTATDPDEGTNAQIMY 

QIVEGNIPEVFQLDIFSGELTALVDLDYEDRPE 

YVLVIQATSAPI..VSRATVHVRLLDRNDNPPV 

LGNFEELFNNYVTNRSSSFPGGAIGRVPAHDP 

DISDSLTYSFERGNELSLVLLNASTGELKLSR 

ALDNNRPLEAIMSVLVSDGVHSVTAQCALRV 

TIITDEMLTHSITLRLEDMSPERFLSPLLGLFIQ 

AVAATLATPPDHVVVFNVQRDTDAPGGHTLN 

VSLSVGQPPGPGGGPPFLPSEDLQERLYLNRS 

LLTAISAQRVLPFDDNICLREPCENYMRCVSV 

LRFDSSAPFIASSSVLFRPIHPVGGLRCRCPPGF 

TGDYCETEVDLCYSRPCGPHGRCRSREGGYT 

CLCRDGYTGEHCEVSARSGRCTPGVCKNGGT 

CVNLLVGGFKCDCPSGDFEKPYCQVTTRSFP 

AHSFITFRGLRQRFHFTLALSFATKERDGLLL 

YNGRFNEKHDFVALEVIQEQVQLTFSAGEST 

TTVSPFVPGGVSDGQWHTVQLKYYNKPLLG 

QTGLPQGPSEQKVAVVTVDGCDTGVALRFGS 

VLGNYSCAANQGTQGGSKKSLDLTGPLLLGG 

VPDLPESFPVRMRQFVGCMRNLQVDSRHIDM 

ADFIANNGTVPGCPAKKNVCDSKTCHNGGTC 

VNQWDAFSCECPLGFGGKSCAQEMANPQHF 

LGSSLVAWHGLSLPISQPWYLSLMFRTRQAD 

GVLLQAITRGRSTITLQLREGHVMLSVEGTGL 

QASSLRLEPGRANDGDWHHAQLALGAIGGP 

GHAILSFDYGQQRAEGNLGPRJLHGLHLSNITV 

GGIPGPAGGVARGFRGCLQGVRVSDTPEGVN 

SLDPSHGESINVEQGCSLPDPCDSNPCPANSY 

CSNDWDSYSCSCDPGYYGDNCTNVCDLNPC 

EHQSVCTRKPSAPHGYTCECPPNYLGPYCET 

RIDQPCPRGWWGHPTCGPCNCDVSKGFDPDC 

NKTSGECHCKENHYRPPGSPTCLLCDCYPTG 

SLSRVCDPEDGQCPCKPGVIGRQCDRCDNPF 

AEVTTNGCEVNYDSCPRAIEAGIWWPRTRFG 

LPAAAPCPKGSFGTAVRHCDEHRGWLPPNLF 

NCTSITFSELKGFAERLQRNESGLDSGRSQQL 

ALLLRNATQHTAGYFGSDVKVAYQLATRLL i 

AHESTQRGFGLSATQDVHFTENLLRVGSALL 

DTANKRHWELIQQTEGGTAWLLQHYEAYAS 

ALAQNMRI-ITYLSPFTJVTPNIVISVVRLDKGN 

FAGAKLPRYEALRGEQPPDLETTVILPESVFR 

ETPPVVRPAGPGEAQEPEELARRQRRHPELSQ 

GEAVASVIIYRTLAGLLPHNYDPDKRSLRVPK 

RPIINTPVVSISVHDDEELLPRALDKJPVTVQFR 

LLETEERTKPICVFWTMHSILVSGTGGWSARGC 

EWFRKESm^SCQCNHMTSFAVLMDVSRRE 

NGEILPLKTLTYVALGVTLAALLLTFFFLTLL 

RILRSNQHGIRRNLTAALGLAQLVFLLGINQA 

DLPFACT VI AILLHFL YLCTFS W ALLEALHL Y 

p At TF\/Drk\/MT/ r JDN/fDT7Wk/ii n\\in\n> a cttv-* 
jK/VL 1 Cj vl\U vrv I KJriVlKr i 1NIL.KJ W O VJrAri I <j 

LAVGLDPEGYGNPDFCWLSIYDTLIWSFAGP 

VAFAVSMSVFLYILAARASCAAQRQGFEKKG 

P VSGLQPS FA VLLLL S ATWLLALLS VN SDTLL 

FHYLFATCNCIQGPFIFLSYVVLSKEVRKALK 

LACSRKPSPDPALTTKSTLTSSYNCPSPYADG 

RLYQP\YGDSAGSLHSTSRSGKSQPSYIPFLLR 

EESALNPGNQGPPGLGGIPGR/I.CFLGRFKDQQ 

H\DS*TRDFDSDLSLEDDQSGSYASTHSSDSEE 
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nucleotide 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
T=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine 5 P-Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V^Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion.. \=possible 
nucleotide insertion 














EEEEEEEEAAFPGEQGWDSLLGPGAERLPLHS 
TPKDGGPGPGKAPWPGDFGTTAKESSGNGAP 
EERLRENGD ALSREG SLGPLPG SSAQPHKGIL 
KKKCLPTISEKSSLLRLPLEQCTGSSRGSSASE 
GSRGGPPSRPPPRQSLQEQLNGVMP1AMSIKA 
GTVDEDSSGSEFLFFNFLH 


425 


1775 


A 


3429 


155 


1417 


GEPAVQSCDCGCTQRSCPWLLVAPGLLSSSSS 

RAASVREAEDAPLQPASIHPVSQGSRGPEGSL 

GSAECLPGDPLGARRATRAHSPYPGPPPSLPA 

AGTAVKRGLQPG* G A/GATSTPGTG A ATGGI , 

CGPAWAAPSAVGPCCCCPS1STTPSQMRSARP 

SLGCLPSWAS\PGTEHPPGPQGPGPS*DLCSV* 

KREFQRGPWAGMVELHRISAADPARAPGPDS 

NLQSALQQPATGCSEPAAVYSPPIGLWGA**P 

EYG*PQHSLPG*TAPADR*P\AGIKDRVYSNSI 

YELLENGQRAGTCVLEYATPLQTLFAMSQYS 

QAGFSRJEDRLEQAKLFCRTLEDILADAPESQN 

NCRLIAYQEPADDSSFSLSQEVLRHLRQEEKE 

EVTVGSLKTSAVPSTSTMSQEPELLISGMEKP 

LPLRTDFS 


426 


1776 


A 


3431 


1662 


369 


AIWWLSWLQHDLLPTPTQVAIDFTASNGDPR 

SSQSLHCLSPRQPNHYLQALRAVGGICQDYD/ 

SVGESGAGGNRQGGLAQRIPQLFLLPSDKRFP 

AFGFGARIPPNFEVG*MRGKEGDGGRVSQAE 

KAGPHCSRLALTG\SHDFAINFDPENPECEGK 

RGDFHLPRLPADTLIITGAQTPLPRAQLPVPST 

HPRPVFINEISGVIASYRRCLPQIQLYGPTNVAP 

IINRV AEP AQREQ STG Q ATK Y S VLL VLTDG V 

VSDMAETRTAIVRASRLPMSIIIVGVGNADFS 

DMRLLDGDDGPLRCPRGVPAARDIVQFVPFR 

DFKDVSPPGPFRLKDSSASHPPKSDLRLPPFD 

VLLRTREPSWPP*SPTSPSDDPASPTLPLTPNHI 

TVPTLVAAPSALAKCVLAEVPRQVVEYYASQ 

GISPGAPRPCTLATTPSPSP 


427 


1777 


A 


3446 


79 


9748 


" GCQSCWPAWPRLRRRGPASAGARLGRKAPW 
GLPGRVQDGRPLRFCFYLRPRAPFIAPVLSGA 
ASRPEASGDCRAGRETAMATLEICLMKAFESL 

KSFQQQQQQQQQQQQQQQQQQQQQQQPPPP 

PPPPPPPQLPQPPPQAQPLLPQPQPPPPPPPPPP 

GPAVAEEPLHRPKKELSATKKDRVNHCLTIC 

ENIVAQSVRNSPEFQKLLG1AMELFLLCSDDA 

ESDVRMVADECLNKVIKALMDSNLPRLQLEL 

YKJEIKKNGAPRSLRAALWRFAELAHLVRPQK 

CRPYLVNLLPCLTRTSKRPEESVQETLAAAVP 

KIMASFGNF ANDNEIKVLLKAFI ANLKS SSPTI 

RRTAAGSAVSICQHSRRTQYFYSWLLNVLLG 

LLVPVEDEHSTLLDLGVLLTLRYLVPLLQQQV 

KDTSLKGSFGVTRKEMEVSPSAEQLVQVYEL 

TLHHTQHQDHNVVTGALELL QQLFRTPPPEL 

LQTLTAVGGIGQLTAAKEESGGRSRSGSIVELI 

AGGGSSCSPVLSRKQKGKVLLGEEEALEDDS 

ESRSDVSSSALTASVKDEISGELAASSGVSTPG 

S AGHDIITEQPRSQH TLQ ADS VDL ASCDLTSS 

ATDGDEEDILSHSSSQVSAVPSDPAMDLNDG 

TQASSPISDSSQTTTEGPDSAVTPSDSSEIVLD 

GTDNQYLGLQIGQPQDEDEEATGILPDEASEA 

FRNSSMALQQAHLLKNMSHCRQPSDSSVDKF 

VLRDEATEPGDQENKPCRIKGDIGQSTDDDS 

APLVHC\ r RLLSASFLLTGGKNVLVPDRDVRV 

SVKALALSCVGAAVALHPESFFSKLYKWLD 
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sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenylaIanine, G=Glycine, H=Histidine, 
1=1 soieu cine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine a P=Proline, 
Q=Glutaminc, R— 4rginine ? S^Serine, 
T=Threonine, V=Valine, W=Tryptophan. 
Y-Tyrosine, X=Unknown, *=Stop codon, 
/=possib!e nucleotide deletion. \=possible 
nucleotide insertion 














TTEYPEEQYVSDILNYrDHGDPQVRGATAlLC 

GTLICS1LSRSRFHVGDWMGTIRTLTGNTFSL 

ADCIPLLRKTLIsX)ESSVTCKLACTAVRNCVM 

SLCSSSYSELGLQLIIDVLTLRNSSYWLVRTEL 

LETL AETDFRJ .VSFLEAKAENLHRG AHHYTGL 

LKJ^QER VLNNV VIHLLGDEDPR VRH V AAASL 

IRLVPKLFYKCDQGQADPVVAVARDQSSVYL 

KLLMHETQPPSIIFSVSTITRJYRGYNLLPSITD 

VTMENNLSRVIAAVSHELITSTTRALTFGCCE 

ALCLLSTAFPVCIWSLGWHCGVPPLSASDESR 

KSCTVGMATMILTLLSSAWFPLDLSAHQDAL 

ILAGNLLAASAPKSLRSSWASEEEANPAATK 

QEEVWPALGDRALVPMVEQLFSHLLKVINIC 

AHVLDDVAPGPAIKAALPSLTNPPSLSPIRRK 

GKEKEPGEQASVPLSPKKGSEASAASRQSDTS 

GPVTTSKSSSLGSFYHLPSYLKLHDVLKATHA 

NYKVTLDLQNSTEKFGGFLRSALDVLSQILEL 

ATLQD1GKCVEEILGYLKSCFSREPMMATVC 

VQQLLKTLFGTNLASQFDGLSSNPSKSQGRA 

QRLGSSSVRPGLYHYCFMAPYTHFTQALADA 

SLRNMVQAEQENDTSGWFDVLQKVSTQLKT 

NLTSVTKNRADKNAIHNHIRLFEPLVIKALKQ 

YTTTTC VQLQKQ VLDLLA QL VQLR VN YCLL 

DSDQVFIGFVLKQFEYffiVGQFRESEAUPNlFF 

FLVLLSYERYHSKQIIGIPKIIQLCDG1MASGR 

KA.VTHAIPALQPIVHDLFVLRGTNKADAGKE 

LETQKEVWSMLLRLIQYHQVLEMFILVLQQ 

CHKENEDKWKRLSRQIADI1LPMLAKQQMHI 

DSHEALGVLNTLFEILAPSSLRPVDMLLRSMF 

VTPNTMASVSTVQLWISGILAILRVLISQSTED 

IVLSRIQEESFSPYL1SCTVINRLRDGDSTSTLE 

EHSEGKQIKNLPEETFSRFLLQLVGILLEDIVT 

KQLKVEMSEQQHTF YCQELGTLLMCLU IIFKS 

GMFRRJT AAATRLFRS DGCGGSF YTLD SLNLR 

ARSMITTHPALVLLWCQILLLVNHTDYRWW 

AEVQQTPKRHSLSSTK1XSPQMSGEEEDSDLA 

AKLGMCNREIVRRGAL1LFCDYVCQNLHDSE 

HLTWLIVNHIQDUSLSHEPPVQDFISAVHRNS 

AASGLFIQAIQSRCENLSTPTMLKKTLQCLEGI 

FILSQSGAVLTLYVDRLLCTPFRVLARMVDIL 

ACRRVEMLLAANLQS SMAQLPMEELNRIQE Y 

LQ S SGLAQR HQRLYSLLDRFRLSTMQD SL SPS 

PPVSSHPLDGDGHVSLETVSPDKDWYVHLVK 

SQCWTRSDSALLEGAELVNRIPAEDMNAFM 

MNSEFNLSLLAPCLSLGMSEISGGQKSALFEA 

AREVTLARVSGTVQQLPAVHHVFQPELPAEP 

AAYWSKLNDLFGDAALYQSLPTLAR.A.LAQY 

LVVVSKLPSHLHLPPEKEKDIVKFWATLEAL 

S WHL1HEQIPLSEDLQAGLDCCCLALQLPGL 

WSVVSSTEFVTHACSL1YCVHFILEAVAVQPG 

EQLLSPERRTNTPKAI SEEEEE VDPNTQNPK Yl 

TAACEMVAEMVESLQSVLALGHKRNSGVPA 

FLTPLLRNIIISLARLPL VNS YTR VPPI ,V WKLG 

WSPKPGGDFGTAFPEIPVEFLQEKEVFICEFIYR 

INTLGWTSRTQFEETWATLLGVLVTQPLVME 

QEESPPEEDTERTQINVLAVQAITSLVLSAMT I 

VPVAGNPAVSCLEQQPRNKPLKALDTRFGRK 

LSIIRGI VEQE1QAMVSKRENI ATHHL YQA WD 

PVPSLSPATTGALISHEKXLLQINPERELGSMS 

YKJ..GQVSIHSVWLGNSITPLREEEWDEEEEEE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of, 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F^Phenylalanine, G=Glycine, H=Histidine, 
l=Iso leucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Threonine ? V* Valine, W-^Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














ADAPAPSSPPTSPVNSRKHRAGVDIHSCSQFL 

LELYSRWILPSSSARRTPAILISEVVRSLLVVS 

DLFTERNQFELMYVTLTELRRVHPSEDEILAQ 

YT .VP ATCK AAAVT .GMDKAVAEPVSRLLESTL 

RSSHLPSRVGALHGILYVLECDLLDDTAKQL1 

PVISDYLLSNLKG1AHCVNIHSQQHVLVMCAT 

AFYLIENYPLDVGPEFSASIIQMCGVMLSGSE 

ESTPSIIYHCALRGLERLLLSEQLSRLDAESLV 

KLSVDRVNVHSPHRAMAALGLMLTCMYTG 

KEKVSPGRTSDPNPAAPDSESVTVAMERVSVL 

FDR1RKGFPCEARVVAR1LPQFLDDFFPPQDIM 

NKVIGEFLSNQQPYPQFMATWYKVFQTLHS 

TGQSSMVRDWVMLSLSNFTQRAPVAMATWS 

LSCFFVSASTSPWVAAILPHVISRMGKLEQVD 

VNLFCLVATDFYRHQ1EEELDRRAFQSVLEV 

VAAPGSPYHRLLTCLRNVHKVTTC 


428 


1778 


A 


3449 


3 


430 


NSRP S PS AAL VE VLLRSG STFPHT V S GG W AA 
WGPWSSCSRDCELGFRVRKRTCTNPEPRNGG 
LPCVGDAAEYQDCNPQACPVRGAWSCWTS 
WSPCSASCGGGHYQRTRSCTSPAPSPGEDICL 
GI ,HTEEALC ATQACPEGWS 


429 


1779 


A 


3464 


583 


3 


DALDRRYLERCHPAAGGWVGEGE*ALCQKT/ 

RFSGVLEPPLPSLKDGGRFPAWT*RSCSKSLR 

AAFTSQFFPSRRSRASPGSAP\GNGQNLTEQHP 

CPGSCDPQVLSASWM*VEHRSKFRPPP*NSTI 

PPES/RS* QGGTVQTGQHSSGREAGS WRARGR 

NAGRR*KGGGKJGTKQGAVRARKECRGEMA 

SGETDSE 


430 


1780 


A 


3473 


2802 


270 


FRMRIFLHCPWNQQMWKJWNLLETSLESCKA 

HLSIQKLLKER\Q\QLPVFKHRDS1VETLKRHR 

VVVVAGETvGSGKSTQVPHFLLEDLLLNEWE 

ASKCNIVCTQPRRISAVSLANRVCDELGCENG 

PGGRNSLCGYQIRMESRACESTRLLYCTTGV 

LLRKLQEDGLLSNVS/HMFIVDEV\HER\SVQS 

DFLLIILKEILQKRSDLHLILMSATVDSEKFST 

Y FTHCPILRI SGRS YP VE VFHLEDI IEETG F VLE 

KDSEYCQKFLEEEEEVTINVTSKAGGIKKYQE 

YIPVQTGAHADLNPFYQKYSSRTQHAILYMN 

PHKINLDLILELLAYLDKSPQFRNIEGAVLIFL 

PGLAH1QQLYDLLSNDRRFYSERYKVIALHSI 

LSTQDQAAAFTLPPPGVRKIVLATNIAETG1TI 

PDVVFVIDTGRTKENKYHESSQMSSLVETFVS 

KASALQRQGRAGRVRDGFCFRMYTRERFEG 

FMDYSVPEILRWLEELCLHIMKCNLGSPEDF 

LSKALDPPQLQVISNAMNLLRKIGACELNEPK 

LTPLGQHLAALPVNVKIGKMLIFGAIFGCLDP 

VATLAAVMTEKSPFTTPIGRKDEADLAKSAL 

AN4ADSDHLTIYNAYLGWKKARQEGGYRSEI 

TYCRRNFLNRTSLLTLEDVKQELIKLVKAAGF 

S SSTTSTS WEGNRA SQTLSFQEI ALLKAVLVA 

GLYDNVGKI1YTKSVDVTEKLACIVETAQGK 

AU V rlr jo v n aulv^ l nu w ll i ycrmv i ru\ v i 

LRETTLITPFPVLLFGGDIEVQHRERLLSIDGW 

IYFQAPVKIAVIFKQLRVLIDSVLRKKLENPK 

MSLENDKILQIITELIKTENN 


431 


1781 


A 


3474 


1 


441 


FRPAPGH VQP* GGSS AAAGGGLLSHPRPCQQ 

PCPPAPAPSRPRSLGSLGQRVPAALATAAQEL 

PATLGGDGGKPALTAGEAALPGLHRSGVPAA 

AARC*PCT/SRPT*STLSPTQAAWWCRPSRRQ 

QRGEASTGGASGRRCGSCFQV 
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SEQJD 


SEQID 


Met 


SEQ 


Predicted 


Predicted end 


Amino acid sequence (A=AJajiine C=Cysleine, 


NO: of 


NO: of 


hod 


ID NO: 


beginning 


nucleotide 


D=Aspartic Acid, E=Glutamic Acid, 


nucl- 


peptide 




in 


nucleotide 


location 


F=Phenylalanine. G=Glycine, H=Histidine, 


eotide 


seq- 




USSN 


location 


corresponding 


l=Isoleucine, K=Lysine, L=Leucine, 


seq- 


uence 




09/496 


correspondi 


to last amino 


M=Methionine, N=Asparagine, P=Proline, 


uence 






914 


ng to first 
amino acid 
residue of 
peptide 
sequence 


acid residue 
of peptide 
sequence 


Q=Glutamine, R=Argininc, S=Serine, 
T-=Threonine ? V-Valine, W^-Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


432 


1782 


A 


3478 


416 


23 


QLRRLTLPNFKTYA'SSMIE1AWH**KNMQID 
QWFRRESPEIDLCKYS*LSFDKEAKAIK/WKJE 
CSLFNKWC/YKN WM/LH VQKKRI * VQTLHPS 
QKLK\SKW1KDLNVECRITKLLDQEYPGDLGY 
SRALNSGSR 


433 


1783 


A 


3504 


1876 


552 


CLAPCSPQPEKNGMQPLLLLLPPLLYQQLLHS 

SLGAPGESTLLVRTSKLLVGLGLQLLVWLLL 

QTRSLLALQLHLTSSAPLLAAPTAVCSCSRCS 

APR SRCV ARPA ARTG1 .PTPAP A SSPAPAASPA 

PAASPAPAESTAXPQPLILLPKP/PPAPGAPPPRP 

GAPPPRPAASPSPAASPAPPAASPVLTASPPLP 

AASPSPAASPAPPAASPVLTASPPLPAASPSPA 

ASPAPPAASPVLTASPPLPAASPALAASPVHT 

ASPPVHVASPPVHTASPPVHVASPPVHTASPP 

VHVASPPVHTASPHVHVASPPVHVASPPVHV 

ASPP VHTASPP VH V ASPP VH1ASPHVH VASPP 

VHTASPPVHVASPPVHVASPPVHVAYPPVHV 

ASPPVHVASPPVHVASPPVSCSGDSTSDCFPP 

QPG A VFPHSLAPSLGG WSHLVAALP 


434 


1784 


A 


3516 


142 


590 


GGVNRPRSETEQVKTPVLISSWDYRHPPPRPA 
SFFVFLV*TGF\TALARMVLISWPCDLPTSASQ 
S AG ITG VRHH A\RLL YFEQESHS VTQAG W\VQ 
WHNLGSLQPLSLEDRLSPGVLGCSALCRSGV 
RTICFGINMVTSRERGTTRLPKEG 


435 


1785 


A 


3529 


1 


3161 


MSLVRAALEALDELDLFGVKGGPQSVIHVLA 

DEVQHCQSILNSLLPRASTSKEVDASLLSVVS 

FPAPAVEDSQLVELTKQEIITKEQGRYGCCRF 

LRDGYKTPKEDPNRLYY/ENPAELKLFENIEC 

EWPLFWTYFILDGVFSGNAEQVQEYKEALEA 

VLIKG KNG VPLLPEL YS VPPDR VDEE YQNPHT 

VDRVPMGKLPHMWGQSLYILGSLMAEGFLA 

PGEIDPLNRRFSTVPKPDVVVQVYPSLPHGCS 

SKSPSHQCTHSIRTTRKITAPVSILAETEEIKTIL 

KDKGI YVETIAEVYPIRVQP ARIL SHI YSSLEIF 

LPFLNSVSGCNNRMKLSGRPYRHMGVLGTSK 

LYDIRICTIFTFTPQFIDQQOFYLALDNKMIVE 

MLRTDLSYLCSRWRMTGQPTITFPISHSMLDE 

DGTSLNSSILAALRKMQDGYFGGARVQTGKL 

SEFLTTSCCTHLSFMDPGPEGKLYSEDYDDN 

YDYLESGNWMNDYDSTSHARCGDEVARYL 

DHLLAHTAPHPKLAPTSQKGGLDRFQAAVQT 

TCDLMSLVTKAKELHVQNVHMYLPTICLFOA 

SRPSFNLLDSPHPRQENQVPSVRVEIHLPRDQ 

SGE VDFKAt VLQLKETSSLQEQAD1I .YMLYT 

MKGPD\\ / NTELYNERSATVRELLTELYGKVG 

EIRHWGLIRY1SGILRKKVEALDEACTDLLSH 

QKHLTVGLPPEPREKTISAPLPYEALTQLIDEA 

SEGDMSI SILTQEIMV YL AM YMRTQPGLFAE 

MFRLRIGLIIQVMATELAHSLRCSAEEATEGL 

MNLSPSAMKNL1.HHTLSGKEFGVERSVRPTD 

SNVSPAISIHEIGAVGATKTERTGIMQLKSEIK 

APDPTCXylTDCCr'CUDC A VH AAC CI/ fiO n AM / 

C^bru 1 blvj 1 FbbObr FisA. YDQ^obKlJbRQGQW 
QRRRRLDGALNRVPVGFYQKVWKVLQKCH 
GLS VEGFVLPS STTREMTPGEIKFS VHVES VL 
NRVPQPE YRQLL VE AIL\ VLTMLADIEIVHSIG S 
JIAVEKI VHIAND1 .FLQEQKTLGADDTMLAKD 
PASGICTLLYDSAPSGRFGTMTYLSKAAATY 
VQEFLPHSICAMQ 


436 


1786 




3546 


73 


393 


CP*LTWELLEVKKAEVLQDSLDGRYSTPSSCL 
EQPDSCRPYGRSFYALEEKHVIFSLDVGETDN 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 



437 



SEQ ID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



1787 



438 



1788 



439 



440 



1789 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 



3554 



5157 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



2939 



Amino acid sequence (A^Alanine OCysleine, 
D=Aspartic Acid, E=Glutamic Acid, 
^Phenylalanine, G=Glycine, H=Histidine ; 
I=lsoleucine, K=Lysine, L=Leucme s 
M=Methionine, N=Asparagine, P-Proline, 
Q=Glutaminc, R=Argininc, S=Serinc, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion. \=possible 

nucleotide insertion 

KGKGKTIRGI*TFKGRKGG'fYQREHDANPLA 



3563 



130 



3565 



446 



1790 



441 



1791 



527 



PXS ARSCWMRKG . __ „ 

AVRAEPGLEELSSGLRAHSPSATTVCEPEAQG 



S ASGCRYAAHPHWGLGGAAAAGGSWEPQPP 
RPVCEPAGRGKPHPPAAPRSPLLPGSRRRPHA 
AQPGARARTSPPPASARNN1AARPAATLAWSL 
LLLSSALLREGCRARFVAERDSEDDGEEPVVF 
PESPLQSPTVLVAVLARNAAHTLPHFLGCLER 
LDYPKSRMAIWAATDHNVDNTTEIFREWLK 
NVQRLYHYVEWRPMDEPESYPDE1GPKHWP 
TSRFAHVMKLRQAALRTAREKWSDYILFIDV 
DNFLTNPQTLNLLIAENKTIVAPMLESRGLYS 
NFWCGITPKGFYKRTPDYWQIREWKRTGCFP 
VPMVHSTFLIDLRKEASDKLTFYPPHQDYTW 
TFDDTIVFAFSSRQAGIQMYLCNREHYGYLPIP 
LKPHQTLQEDIENL1HVQIEAMIDRPPMEPSQ 
YVSVVPKYPDKMGFDEIFMINLKRRKGQGGD 
RWLRTLYEQEIEVKIVEAVDGKALNTSQLKA 
LNIEMLPGYRDPYSSRPLTRGEIGCFLSHYSV 
WKEVIDRELEKTLVIEDDVRFEHQFKKKLMK 
LMDNIDQAQLDWELIYIGRKRMQVKEPEKA 
VPNVANLVEADYSYWTLGYVISLEGAQKLV 
GANPFGKMLPVDEFLPVMYNKHPVAEYKEY 
YESRDLKAFSAEPLLIYPTHYTGQPGYLSDTE 
TSTIWDNETVATDWDRTHAWKSRKQSRIYSN 
AKNTEALPPPTSLDTVPSRDEL 



1834 



3568 



3569 



350 



IFFN S SSLFCRVFCLFLR WSFTL V AQ ARV Q* C 
NLSSLQPLPPGFK*FSCLSPPRS*DYRRPPPRPA 
NFLYF* * RQGFTVLGQAGLELLT/S/GDPPTSA 
SQSAGITGVSHRAWPVHA1STH1SLVKTRPSLT 

TLG 



LLQPAMRKSPGLSDCLWAWILLLSTLTGRSY 
GQPSLQDELKDNTTVFTRILDRLLDGYDNRL 
RPGLGERVTEVKTDIFVTSFGPVSDHDMEYTI 
D VFFRQ S WKDERLKFKGPMT VLRLNNLM AS 
KIWTPDTFFHNGKICSVAHNMTMPNKLLRITE 
DGTLLYTMRLTVR\AECPMAFGRDFPM\D\AH 
ACPLKFGSYAYTRAEWYEWTREPARSVVV 
AEDGSRLNQYDLLGQTVDSGIVQSSTGEYVV 
MTTHFHLKRKIGYFV1QTYLPCIMTVILSQVSF 
WLNRESVPARTVFGVTT VLTMTTLSIS ARN SL 
PfCV AYATAMD WFIA VC YAF VFS ALIEFAT VN 
YFTKRG YAWDGKS WPEKPKKVKDPL IKKN 
NTYAPTATSYTPNLARGDPGLATIAKSATIEP 
KEVKPETKPPEPKKTFNSVSKIDRLSRIAFPLL 
FGIFNL V YWATYLNREPQLKAPTPHQ 



1751 



STSSCFPAAAAA1MREIVHLQAGQCGNQIGAK 
FWEVISDEHGIDPTGTYHGDSDLQLERINVYY 
NEATGEAPVPSPTALRGPRGPCLG*RPPVPAG 
GKYVPRAVLVDMEPGTMDSV 



FVAVAGAVSGEPLVHWCTQQLRKTFGLDVS 

EEUQYVLSIESAEEIREYVTDLLQGNEGKKGQ 

FIEELITKWQKNDQELISDPLQQCFKKDEILDG 

QKSGDHLKRGRKKGRNRQEVPAFTEPDTTAE 

VKTPFDLAKAQENSNSVKKKTKFVNLYTREG 

QDRLAVLLPGRHPCDCLGQ1CHKLINNCLICG 

RIVCEQEGSGPCLFCGTLVCTHEEQDILRGDS 

N\KSQKLLKKLMSGVENSGKVDISTKDLLPH 

QELRIKSGLEKAIKHKDKLLEFDRTSIRRTQVI 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine, 
D=Aspartic Acid, £=Glutamic Acid, 
F=Phenyl alanine, G=Glycine, H=Histidine 3 
I=Isoleucine 3 K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














DDESDYFASDSNQWLSKLERETLQKREEELR 

ELRI IA S RLSKK VTIDF AGRKILEEENSL AE Yli 

SRLDETIQAIANGTLNQPLTKLDRSSEEPLGVL 

VNPNMYQSPPQWVDHTGAASQKKAFRSSGF 

GLEFNSFQHQLRIQDQEFQEGFDGGWCLSVH 

QPWASLLWGIKRVEGRSWYTPHRGRLWIAA 

TAKKPSPQEVSELQATYRLLRGKDVEFPNDY 

PSGCLLGCVDLIDCLSQKQFKEQFPDISQESDS 

PFVFICKNPQEMVVKFPIKGNPKIWKLDSKIH 

QGAKKGLMKQNKAV 


442 


1792 


A 


3576 


1 


2019 


MPRSHTGERLCEGKEGSQCAENFSPNLSVTK 

KTAGVKPYECTICGKAFMRLSSLTRHMRSHT 

AIRA1\EKPYKCICEC\GRAFSLSQILSKMIERSH 

TGEKPYKCKQCGKTFIYHQPFQRHERTHIGEK 

PYECKQCGKALSCSSSLRVHERIHTGEKPYEC 

KQCGKAFSCSSSIRVHERTHTGEKPYACK\EC 

GKAFISVTTSVLTHMITHNGDRPYKCKECGKA 

FIFPSFLRVHERIHTGEKPYKCKQCGKAFRWS 

TSIQIHERIHTGEKPYKCKECGKSFSARPAFRV 

HVRVHTGEKPYKCKECGKAFSRISYFRIHERT 

HTGEKPYECKKCGKTFNYPLDLKIHKRNHTG 

EKPYECKECAJCTFISLENFRRHMITHTGDGPY 

KCRDCGKVFIFPSALRTHERTHTGEKPYECKQ 

CGKAFSCSSYIRIHKRTHTGEKVPYECKECGK 

AFIYPTSFQGHMRMHTGEKPYKCKECGKAFS 

LHSSFR\RHTRIHNYEKPLEC*Q\CGKAFSVSTS 

LKKPMRNAQSDRKLY/KCEK*EICVFNSNRCF 

QSCENSH*REKSCQCK*YRKRDTR*FMYSQV 

PHNHVSVSNGPYR/CGSPrRLYNT*NISrNRNL 

VAVVTP*CSTLFKCLWCWCKRAALSW*/IVQ 

DSGRGRWLTPVIPALWEAKAGGSRGQEIKTIL 

ANTVKPHLY 


443 


1793 


A 


3578 


287 


114 


DF YERKFEQFIEGHKQI VNK WRDLLCS WKRK 
LSIIKKSVLQNNL*FSAASMRFQKVFF 


444 


1794 


A 


3582 


3335 


1909 


HLFFSLFLAAMAMTGSTPCS SMSNHTKERVT 

MTKVTLENFYSNLIAQHEEREMRQKKLEKV 

MEEEGLKDEEKRLRRSAHARKETEFLRLKRT 

RLGLEDFESLKVIGRGAFGEVRLVQKKDTGH 

VY AMKILRKADMLEKEQVGHERAERDIL VEA 

DSLWVV1CMFYSFQDKLNLYLIMEFLPGGDM 

MTLLMKKDTLTEEETQFYIAETVLAIDSIHQL 

GFIHRDIKPDNLLLDSKGHVKLSDFGLCTGLK 

KAHRTEFYRNLNHSLPSDFTFQNMNSKRKAE 

TWKRNRRQLAFSTVGTPDYIAPEVFMQTGYN 

KLCDWWSLGVIMYEMLIGYPPFCSETPQETY 

KKVMNWKETLTFPPEVPISEKAKDL1LKJCCE 

WEHRIGAPGVEEIKSNSFFEGVDWEHIRERPA 

AISIEIKSIDDTSNFDEFPESDILKPTVATSNHPE 

TDYKNKDWVFINYTYKRFEGLTARGAIPSYM 

KAAK 


445 


1795 


A 


3584 


! 


6169 


RTRGIEKRFAYSFLQQL1RYVDEAHQY1LEFD 

GGSRGKGEHFPYEQEIKFFAKVVLPLIDQYFK 

NHRLYFLSAASRPLCSGGHASNKEKEMVTSL 

FCKLGVLVRFIRISLFGNDATSIVNCLHILGQT 

LDARTVMKTGLESVKSALRAFLDNAAEDLE 

KTMENLKQGQFTHTRNQPKGVTQIINYTT\'A 

LLPMLSSLFEHIGQHQFGEDLILEDVQVSCYRI 

LTSLYALGTSKSIYVERQRSALGECLAAFAGA 

FPVAFLETHLDKHNIYSIYNTKSSRERAALSLP 

TNVEDVCPNIPSLEKLMEEIVELAESGIRYTQ 
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of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
IJ— Aspartic acig, e>— oiuicinuc /\i-iu, 
F^Phenylalanine, G=Glycine, H=Histidine, 
Wsolcucinc. KHLysinc, L=Lcucinc, 
M=Methionine, N=Asparagine, FHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan. 
Y=Tyrosine, X=Unknown 5 *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














MPHVMEVILPMLCSYMSRWWEHGPENNPER 

AEMCCTALNSRHMKTLI.GNILKinWILGIDE . 

GAWMKRLAVFSQPriNKVKPQLLKTHFLPLM 

EKLKKKAATVVSEEDHLKAEARGDMSEAEL 

LILDEFTTLARDLYAFYPLLIRFGDYNRAKWL 

KEPNPEAEELFRMVAEVFIYWSKSHNFKREE 

QNFWQNEINNMSFLITDTKSKMSKAAVSDQ 

ERKJOMUCRKGDRYSMQTSLIVAALKRLLPIGL 

NICAPGDQELIALAKNRFSLKDTEDEVRDI1RS 

NIHLQGKLEDPAIRWQMALYKDLPNRTDDTS 

DPEKTVERVLD1AN VLFHLEQKSKRVGRRI IY 

CLVEHPQRSKKAVWHKLLSKQRKRAVVACF 

RMAPLYHLPRHRAVNLFLQGYEKSWIETEEH 

YFEDKLIEDLAKPGAEPPEEDEGTKRVDPLHQ 

L1LLFSRTALTEKCKLEEDFLYMAYADIN4AKS 

CHDEEDDDGEEEVKSFEEKEMEKQKLLYQQ 

ARLHDRGAAEMVLQT1SASKGETGPMVAAT 

LKLGIAILNGGNSTVQQKMLDYLKEKKDVGF 

FQSLAGLMQSCSVLDLNAFERQNKAEGLGM 

VTEEGSGEICVLQDDEFTCDLFRFLQLLCEGH 

NSDFQNYLRTQTGNNTTVNIILSTVDYLLRVQ 

ESISDFYWYYSGKDVIDEQGQRNFSKAIQVA 

KQVFNTLTEYIQGPCTGNQQSLAHSRLWDAV 

VGFLHVFAHMQMKLSQDSSQIELLKELMDLQ 

KDMVVMLLSMLEGNVVNGTIGKQMVDMLV 

ESSNNVEMILKFFDMFLKLKDLTSSDTFKEYD 

PDGKGVIFKRDFHKAMESHKHYTQSETEFLL 

SCAETDENETLDYEEFVKRFHEPAKDIGFNVA 

VLLTNLSEHMPNDTRLQTFLELAESVLNYFQP 

FLGRIEIMGSAKR1ERVYFEISESSRTQWEKPQ 

VKESKRQFIFDVVNEGGEKEKMELFVNFCED 

TIFEMQL AAQI SES DUSTERS ANKEESEKERPEE 

QGPRMAFFSILTVRSAJLFALRYNILTLMRMLS 

LKSLKKQMKXVKKMTVKDMVTAFFSSYWSI 

FMTLLHFVASVFRGFFRIICSLLLGGSLVEGA 

KKIKVAELLANMPDPTQDEVRGDGEEGERKP 

LEAALPSEDLTDLKELTEESDLLSDIFGLDLKR 

EGGQYKLIPHNPNAGLSDLMSNPVPMPEVQE 

KFQEQKAKJEEEKEEBCEETKSEPEKAEGEDGE 

KEEKAKEDKGKQKLRQLHTHRYGEPEVPESA 

FWKKIIAYQQKLLNYFARNFYNMRMLALFV 

AFAINFILLF YK VSTS S VVEG KELPTRS S SENA 

KVTSLDSSSHRIIAVHYVLEESSGYMEPTVRIL 

PILHTVISFFCIIGYYCLKVPLVIFKREKEVARK 

LEFDGLYITEQPSEDDUCGQWDRLV1NTQSFP 

NN Y WDKFVKRKV MDKYGEFYGRDRI SELLG 

MDKAAJLDFSDAREKKKPKKDSSLSAVLNSID 

\rwr\\ji\\rf<n H. WPTTYN'sFT YT AWYMTMSV1, 
Vlv I yjivy W 1SX#0 V Vrl L/l\jrL I l^rv vv i ivi. x lmij v i-i 

GHYWNFFFAAHLLDIAMGFKTLRTILSSVTH 

NGKQLVLTVGLLAV V VYLYTW AFNFFRKF 

YNKSEDGDTPDMKCDDMLTCYMFHMYVGV 

RAGGGIGDEIEDPAGDEYEIYRIEFDITFFFFVI 

VILLAIIQGLIIDAFGELRDQQEQVKEDMETKC 

FICGIGNDYFDTVPHGFETHTLQEHNLANYLF 

FLMYLINKDETEHTGQESYVWKMYQERCWE 

FFPAGDCFRKQYEDQLN 


446 


1796 


A 


3592 


1 


355 


" AGLELLNSDDPPALASQSAG1TGVTRTPSLFF* 
DTVLLCCSGWSAVAPSRLTAALFS*AQAVCL 
SLPRSWDYRRW/PPHPANFCIFCRDE/SLA/ML 
PRLVSNSWTQAILLPRPPKMLGLQV 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A— Alanine C-Cysteine, 
D=Aspartic Acid, E^GJutamic Acid, 
F=PhenyJalanine, G=Glycine, H^Hisridine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline. 
Q=Glutamine, R=Argininc, S-Scrinc, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X^Unknown, +=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


447 


1797 


A 


3598 


1202 


1070 


LFVGGGPICPEGASGFAPGPAPAPRVGVDAEV 
GR*V*GAAASQGA/GSLRPRPTGPGHPGAWL 
QV WG AAA VC AGP AM * /A VRAKRGPRAG* EP 
NSPWRSGVLAA\RAVGAGPWP*P*PGCS*ARG 
PSSRSAPGLASGPAAPLLQGVHSSAGPLLCYI 
NGTLALGLKP* * AWG WGEWRPKG 


448 


1798 


A 


3604 


3115 


557 


FRRKGGGGPKDFGAGLKYNSRHEKVNGLEE 

GVEFLPVNNVKK\^EKHGPGRWVVLAAVLIG 

LI.J.YLLGIGFLVWHLQYRDVRVQKVFNGYM 

RITNEMFVDAYENSNSTEKVSLASKVKDAL1CL 

LYSGVPFLGPYHKESAVTAFSEGSVIAYYWSE 

FSIPQHLVEEAERVMAEERVVMLPPRARSLKS 

FWTSWAFPTDSKTVQRTQDNSCSFGLHAR 

G VELMRFTTPGFPDSP YP AH ARCQ W ALRG D 

ADSVLSLTFRSFDLASCDERGRHLV\TVYNT\L 

SPMEPHA\LVQLCGTYPPSYNLTFHS\S\QNVL 

LITLITNTERRHPG\FEATFFQLPRMSSCGGRL 

RKAQGTFNSPYYPGHYPPNIDCTWNIEVPNN 

QI-IVK VRFKFF YLLEPG VPAGTCPKD Y VEING 

EKYCGERSQFVVTSNSNKITVRFHSDQSYTDT 

GFLAEYLSYDSSDPCPGQFTCRTGRCIRKELR 

CDGWADCTDHSDELNCSCDAGHQFTCKNKF 

CKPLFWVCDSLNDCGDNSDEQGCSCP\AQTF 

RCSNGKCLSK.SQQCNGKDDCGDGSDEASCP 

KVNVVTCTKHTYRCLNGLCLSKGNPECDGK 

EDCSDGSDEKDCDCGLRSFTRQARWGGTD 

ADEGEWPWQVSLHALGQGHICGASLISPNWL 

VSAAHCYIDDRGFRYSDPTQWTAFLGLHDQS 

QRSAPGVQERRLKRIISHPFFNDFTFDYDIALL 

ELEI<CPAEYSSMVRPICLPDASHVFPAGI<AIWV 

TGWGHTQYGGTGALILQKGE1RVJNQTTCEN 

LLPQQITPRMMCVGFLSGGVDSCQGDSGGPL 

SSVEADGRJFQAGVVSWGDGCAQRNKPGVY 

TRLPLFRDWIKENTGV 


449 


1799 


A 


361S 


2 


613 


F VSG SP WRMDGSTERLE ARRP AGRLP WSSRQ 

EMTRRPSLMAGRQHG WS AQQ S AT V ANP VPG 

ANPDLLPHFLGEPED V YI VKNKPVLL VCKA V 

PATQIFFKCNGEWVRQVDHVIERSTDGSSGLP 

TMEVRJNVSRQQVEKVFGLEEYWCQCVAWS 

SSGTTKSQKAY1RIAYLRKNFEQEPLAKEVSL 

EQGIVLPCRPPEGIPPAE 


450 


1800 


A 


3620 


1 


2676 


MEPSLGQGMDLTCPFGVSPACGAQASWSIFG 

ADAAEVPGTRGHSQQEAAMPHIPEDEEPPGE 

PQAAQSPAGQQGPPTAGVSCSPTPTIVLTGDA 

TSPEGETDKNLANRVHSPHKRLSHRHLKVST 

ASLTSVDPAGHIIDLVNDQLPDISISEEDKKKN 

LALLEEAKLVSERFLTRRGRKSRSSPGDSPSA 

VSPNLSPSASPTSSRSNSLTVPTPPEGDEADVS 

SPHPGEPNVPKGLADRKQNDQRKVSQGRLAP 

RPPPVEKSKE1AIEQKENFDPLQYPETTPKGLA 

PVTNSSGKMALNSPQPGPVESELGKQLLKTG 

WEGSPLPRSPTQDAAGVGPPASQGRGPAGEP 

MGPEAGSKAELPPTVSRPPLLRGLS WDSGPEE 

PGPRLQKVLAKLPLAEEEKRFAGKAGGKLAK 

APGLKDFQIQVQPVRMQKLTKLREEHILMRN 

QNLVGLKLPDLSEAAEQEKGLPSELSPAIEEE 

ESKSGLDVMPNJ SD VLLRKLRVHRSLPGSAPP 

LTEKEVENVFVQLSSAFRNDSYTLESRJNQAE 

RERNLTEENTEKELENFKASITSSASLWHI ICE 

HRETYQKLLEDIAVLHRLAARLSSRAEVVGA 



188 



5 DOC ID: <WO 0157188A2_I_> 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-Alanine OCysteine, 
o=A<:nartir Arid R— Glutamic Acid, 
^Phenylalanine, G=Glycine, H=Histidine 3 
I=Isoleucine, K=Lysine 3 L=Leucine, 
M=Methionine, N=Asparagine, P-Proline, 
Q^Glutamine, R=Argininc, S=Serine, 
T^Thieonine, V=Valine, W-Tryptophan : 
Y=Tyrosine, X^Unknown. *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion _j 














VRQEKRMSKATEVMMQYVENLKRTYEKDH 

AELMEFKKLANQNSSRSCGPSEDGVLRTARS 

MSLTLGKNMPRRRVSVAWPKFNALNLPGQ 

TPSSSSIPSLPALSESPNGKGSLPVTSALPALLE 

NGKTN GDPDCEAS APALTLSCLEEL SQETKA 

RMEEEAYSKGFQEGLKKTKELQDLKEEEEEQ 

KSESPEEPEEVEETEEEEKDPRSSKLEELVHFL 

QVMYPKLCQHWQVIWMMAAVMLVLTVVL 

GLYNSYNSCAEQADGPLGRSTCSAAQKDSW 

WSSGLQHEQPTEQ 


451 


1801 


A 


3623 


504 


198 


" QLIQHQTVHTGRKLYECKECGKAFNQGSIL1 
RHQRIHTGEKPYECKVCGKAFRVSSQLKQHQ 
RIHTGERPYQCKELKGRGAEMLAVLAVKEQ 
NRTPVNYGK 


452 


1802 


A 


3628 


2 


195 


" MTCLHSAKAFHY* SSCSFSCEEGFALIGPE V V 
QCTALGVWTAPAPVC1AVQCQHLEALNEGT 
MG*DYPFTAFAYGSSCKYECHTVYRVRGLD 
MLHSRGCYL WNGHFTT* EAISCEPLERPCH* S 
V*CSFSCEEGFALIGPEWQCTALGVWTAPAP 
VCIAVQCQHLEALNEGTMG 


453 


1803 


A 


3637 


662 


142 


IQAKGLGIWHVPNKSPMQHWR\KGSLLRYRT 
DTHFI OTLGHNLLGIYQKYPVKYGEGKCWT 
DNGPVIPVVYDFGDAQKTASYYSPYGQREFT 
AGF VQFRVFNNERAAN ALC AGMRVTG CNTE 
HHCIGGGGYFPEASPQQCGDFSGFDWSGYGT 
\HVGYSSSREITE\AAVLLFYR 


454 


1804 


A 


3641 


1 




TQVHPAMLGLDELGRSGCGHCTQADLRFGD 
AAGRDPGQDNDRNTAEPAFPPPPRVMAAAA 
ALRAPAQSSVTFEDVAVNFSLEEWSLLNEAQ 
GCL YHD VMLETLTLI SSLGK VLILNCDL S 


455 


1805 


A 


3646 


2 


414 


AAAGRGASGALTGEGGGEQGRRVGLGSRAH 

SLLLGPTFNSCQVSSQPPRVAGLGLPLKHEPS 

RPQPPSPRGPRTVRAGVPGAHPQDTPCPEFVR 

PRKVPLVGEAPGLPPEERSRGWRRDTPGLQE 

SRVRAPSYDDIT 


456 


1806 


A 


3656 


396 


S 


' " QIVSFNSYLTLYTKNNLKSMKDLNVNTEMIK 
LLELKNIHNLG*AKFFLN*IQKALIKRKILIHW 
P/LIKIK/SFCSLSDTIKKlVIKilQTIVWEQTFIIHI 
SVKELVSRIYEAFLQFNKTVNRPVFDIKKEQK 
F 


457 


1807 


A 


3660 


14 


1961 


SE AKLGGPTGMDLWQLLLTLALAGSSDAFSG 

SE ATAAILSRAP WSLQS VNPGLKTNS SKEPKF 

TKCRS PERETFSCH WTDE VHHGTKNLGPIQLF 

YTRKNTQEWTQEWKECPDYVSAGENSCYFN 

SSFTSIWIPYCIKLTSNGGTVDEKCFSVDEIVQ 

PDPPLALNWLLNVSLTGIHADIQVRWEAPRN 

ADIQKGWMVLEYELQYKEVNETKWKMMDP 

ILTTSVPVYSLKVDKEYEVRVRSKQRNSGNY 

GEFSEVLYVTLPQMSQFTCEEDFYFPWLLIIIF 

GIFGLT VMLF VFLF SKQQRIKMLILPPVP VPKI 

KGIDPDLLKEGKLEEVNTILA1HDSYICPEFHS 

DDSWVEFIELDIDEPDEKTEESDTDRLLSSDH 

EKLHINLGVKDGDSGRTSCCEPDILETDFNAH 

D1HEGTSEVAQPQ1U.KGEADLLCLDQKNQNN 

SPYHDACPATQQPSVIQAEKNKPQPLPTEGAE 

STHQAAHIQLSNPSSLSNIDFYAQVSDITPAGS 

VVXSPGQKNKAGMSQCDMHPEMVSLCQENF 

LMDNAYFCEADAKXCIPVAPHIKVESHIQPXS 

LNQEDTYITTESLT\TAAGSP\GTGEHVPGSEM 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
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SEQ ID 
NO: of 
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seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=GJycine, H=Histidine, 
l=Isoleucine, K=Lysine. L=Leucine, 
M=Methionine J N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T=Thrconinc, V=V aline, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














PVPDYTSIHIVQSPQGLILNATALPLPDKEFLS 
SCGYVSTDQLNKIMP 


458 


1808 


A 


3663 


154 


462 


TRAPASGRSGAGLALSANAPDSGGHPGATEG 
PAG SLAHASGSARGTWRVRGRGSHG WERT V 
GAGGCANPVPALHSCASAPRGTGRVSALGPK 
TGSSPLSSPKG 


459 


1809 


A 


3664 


902 


135 


LGKYNTSMALFDFVLHNSTGEIRYITEDDVIQ 

SQNALGKYNTSMALFESNSFEKTILESPYYVD 

LNQTLFVQVSLHTSDPNLVVFLDTCRASPTSD 

FASPTYDLIICSGCSRDETCKWYPLFGHYGRF 

QFNAFKFLRSMSSVYLQCKVLICDSSDHQSRC 

\NQGCVSRSI<JU)ISSYKWKTDSIIGPIRLKRDR 

SA\NGNSGFQHETHAEETPNQPFNSVHLFSFM 

VLALNVVTVATITVRHFVNQRADYQXYQKLQ 

NY 


460 


1810 


A 


3670 


850 


557 


LGILMSPQVEAGEI*ALLTPPPGCMQFSPLTL/P 
K* WVSPGLTP/PPPE VPS VFL VEPGLPHAGQA 
GLDLUTSGDPPASTSQSARTTDVSHRAQPLAI 
S 


461 


1811 


A 


3671 


2472 


2099 


IGVLAFETGSCSVTRLYCIGIIMPHCSLDLAGSX 
TSAFR1AGTTSVHHHPQLTFFFFWIETGSHCV 
VQTGL*LLALSNPPALASQ1AGISGMSHRAWP 
GLVLYSLEFSLLCASQSLIMLFTCYNE 


462 


1812 


A 


3672 


394 


110 


VKPVNGESKRD* GADTQTCEGEADEQLQTXN 
CYYD/STKSFFYISCG*K\RKPTWAENRRLNA 
KMFGIPLHS N SDP WG YEERE V iGFHRSR VSRG 
HGS 


463 


1813 


A 


3673 


348 


1 


QRNPFSAGI IPQRPPTSGSQSELLAQPRLRPGR 

KSSFSRDQDVW * SQAVPKRQ* QRNPFS AGHP 

QRPPTSGSQSELLAQPRLRPGRKSSFSRDQDV 

WPGQKPRPSQQQHQMCASPTLGQRSPFALEP 

VPAYHGGRDPFASARPSPVG1PKPRAAPAGG 

GWRRIRPKSSTK 


464 


1814 


A 


3676 


2253 


320 


PVIQRCSQPYGFSLLISFFLKCVSETSQQPPSR 

KVFQLLPSFPTLTRSKSHESQLGNRIDDVSSM 

RFDLSHGSPQMVRRD1GLSVTHRFSTKSWLS 

QVCHVCQKSMIFGVKCKHCRLKCHNKCTKE 

APACRISFLPLTRLRRTESVPSDINNPVDRAAE 

PHFGTLPKALTKKEHPPAMNHLDSSSNPSSTT 

FSTPSSPAPFPTSSNPSSATTPPVNPSP\GQR\DSR 

FNFPSC/AYFIHHR\Q\QFIFPDISAFAHAAPLPE 

AA DGTRLDDQPKAD VLE AHE AE AEEPEAGK 

SEAEDDEDEVDDLPSSRRPWRGPISRKASQTS 

VYLQEWDIPFEQVELGEP1GQGRWGRVHRGR 

WHGEVAIRLLEMDGHNQDHLKLFKKEVMN 

YRQTRHENVVLFMGACMNPPHLAIITSFCKG 

RTLHSFVRDPKTSLDINKTRQIAQEIIKGMGY 

LHAKGIVHKDLKSRNVFYDNGXKVVITDFGLF 

\GISGVVP\EGRRENQLKLSHDWLCYLAPEIVR 

EMTPGKJDEDQLPFSKAADVYAFGTVWYELQ 

ARDWPLKNQAAEASIWQIGSGEGMKRVLTS 

VSLGKEVSENLSACWAFDLQERPSXFSLLMD 

N4LEKLPKLNRRLSHPGHF*KSADINSSKVVPR 

FERFGLGVLESSNPKM 


465 


1815 


A 


3679 


8 


803 


IPSPAWWNSTWADTFSLLLALAVALYLGYY 

WACVLQTHRAFCASNTEDLETVVNHIKHRYP 

QAPLLAVGISFGGILVLNHLAQARQAAGLVA 

ALTLSACWDSFETTRSLETPLNSLLFNQPLTA 

GI.,CQLVERLSY/R*")T.,QARTIRQFDERYTSVA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

P\— A onnriiV A r-i rl T7— fill ttClTnic ApiH 

L>=ASpanic AciUj c— vjiuidinic aciu, 
F=PhenyIalanine 5 G-Glycine, H=Histidine, 
I=Isoleucinc, K=Lysinc. LHLeucine, 
M-=Methionine, N=Asparagine 7 P=Proline a 

{~\ — filntorninp T? A rcrintnp ^f*riHP 
V^ = OJULd.iIlUlC > ix — rvlgllltlic, u owuit, 

T=Threonine, V=V aline, W=Tryptophan : 
Y= Tyrosine, X=Unknown, *=Stop codon, 
/--possible nucleotide deletion, \=possib)e 














FGYQDCVTYYKAASPRTKIDAIRIPVLYLSAA 
DDPFSTVCALPKQAAQHSPYVALLITARGGHI 
GFLEGLLPWQHWYMSRLLHQYAKA1FQDPE 
GLPDLRALLPSEDRNS 


466 


1816 


A 


3684 


3 


307 


SSQYIVQSKTKIFL*AAREKQ/RHTCRRFSIRLS 
ANISSQTGEARGQWPSVFKVLKEKKLSTKKS 
FGQK* GRXRKTFPDKQK/LREFDTTRPTJQEML 
TGVLQG 


467 


1817 


A 


3687 


2465 


837 


ELPTPL1AAHQLYNYVADHASSYHMKPLRMA 

RPGGPEHNEYALVSAWHSSGSYLDSEGLRHQ 

DDFDVSLLVCHCAAPFEEQGEAERHVLRLQF 

FVVLTSQRELFPRLTADMRRFRKPPRLPPEPE 

APGSSAGSPGEASGLILAPGPAPLFPPLAAEVG 

MARARLAQLVRLAGGHCRRDTLWKRLFLLE 

PPGPDRLRLGGRLALAELEELLEAVHAKSIGD 

IDPQLDCFLSMTVSWYQSLIKVLLSRFPQSCR 

HFQSPDLGTQYLVVLNQKFTDCFVLVFLDSH 

LOK. 1 I V V rKhrrr V l^r l^IJoilor r /vv^i^ v o l j 

HHLES VINT ACFTL WTRLL* GSGLDH*MSLFL 
ESWAYQIACQRQD*PALLGPRASQTLSDTKG 
FVTMS*GSAAPAWQQEPPSPNTHSH*PIQDSR 
ESGQPRGPLGPFWGTPFGPPGRVSGVHTGWQ 

D1TAGHSHRDDTWVPIPALPLKHLRPPSSPFA 
LGPWVSHPLMRWVQKLSHLHSNPGTGFSMG 
GKQQRN 


468 


1818 


A 


3691 


960 


499 


QTCRKDKRAI YPHFQNE* MNEIKAI* SGTGG1 

QCFHSQNDSAFFFFLFLLETEFCSAA/TVQWH 

DFLSMQPPPPGFKQFTCLSLLSSWNYRRVPPPF 

PONF\*FLVKTGFPHVGQTGFELLTSSDLAPLA 

SQNGGITGMSPCAVVPFFFFFFFGLC 


469 


1819 


A 


3714 


4747 


495 


MAYSWQTDPNPNESHEKQYEHQEFLFVNQP 

HSSSQVSLGFDQIVDEISGKIPHYESEIDENTFF 

VPTAPKWDSTGHSLNEAHQISLNEFTSKSREL 

SWHQVSKAPA1GFSPSVLPKPQNTNKECSWG 

SPIGKHHGADDSRFSILAPSFTSLDKINLEICEL 

ENENHNYHlGFESSiPPTNSSFSSDFMPKEENK 

RSGHVNIVEPSLMLLKGSLQPGMWESTWQK 

NIESIGCSIQLVEVPQSSNTSLASFCNKVKKIR 

ERYHAADVNFNSGKIWSTTTAFPYQLFSKTK 

FNIHIF1DNSTQPLHFMPCANYLVKDUAE1LH 

FCTNDQLLPKDHILSVWGSEEFLQNDHCLGS 

HICMFQ1CDKSVIQLHLQKSREAPGKLSRICHEE 

DHSQFYLNQLLEFMHIWKVSRQCLLTLIRKY 

DFHLKYLLKTQENVYNIIEEVKiaCSVLGCVE 

TKQITDA\ /r NELSLILQRKGENFYQSSETSAKG 

UEKVTTELSTSIYQLINVYCNSFYADFQPVNV 

PRCTSYLNPGLPSHLSFTVYAAHNIPETWVHR 

INFPLEIKSLPRESMLTVKLFGIACATNNANLL 

AWTCLPLFPKEKSILGSMLFSMTLQSEPPVEM 

ITPGVWDVSQPSPVTLQ1DFPATGWEYMKPD 

SEENRSNLEEPLKECIKHIARLSQKQTPLLLSE 

EKKRYI ,WF YRF YCNNENCSLPLVLGS APGW 

DERTVSEMHTILRRWTFSQPLEALGLLTSSFP 

DQEIRKVAVQQLDNLLNDELLEYLPQLVQAV 

KFEWNLESPLVQLLLHRSLQSIQVAHRLYWL 

LKNAENEAYFKSWYQKLLAALQFCAGKALN 

DEFSKJEQKLIKILGDIGERVKSASDHQRQEVL 

KKEIGRLEEFFQDVNTCHLPLNPALCIKGIDH 

DACSYFTSNALPLKITFINANLMGKNISI1FKA 
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SEQ ID 
NO: of 
nucl- 
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seq- 
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SEQ ID 
NO: of 
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seq- 
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Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

L-J*^ I I 1 1 I 11 I 

nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

niirlf»ntiHp 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteine, 

r"l=s A ^narrir* AriH P'stf'iliiTflmir* AcmH 

F=Phenylalanine ; G=Glycine, H^Histidine, 
I=Isoleucine, K=Lysine 3 L=Leucine, 
M=Methionine, N=Asparagine, P=Prolinc, 
Q=Glutamine s R-Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X~Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














GDDLRQDMLVLQLIQVMDNIWLQEGLDMQ 

MHYRCLSTGKDQRLVQMVPDAVTLAKIHRH 

SGLIGPLKENTIKKWFSQHNHT.KADYEKALR 

NFFYSCAGWCWI'FILGVCDRHNDNIMLTKS 

GHMFHIDFGKFLGHAQTFG GIKRDRAPFIFTS 

EM\EYFITEGG\KNPQHFQDFV\ELCCRAYNIIR 

KHSQLLL\NLLVEMMLYAG\LPELSGI\QDLKY 

VYNNLRPQDTDLEATSHFTKKIKESLECFPVK 

LNNLIHTLAQMSAISPAXSTSQTFPQESCLLST 

TRSIFHATII OF'sK'K Yf IflVTW^XTMPTQI 
i iv o l iZj i n-ta. i ij^vjr oivJvooiNjj i l#jlv^ v x n O IN IN C 1 oL 

TEKSFEQFSKLHSQLQKQFASLTLPEFPHWW 
HLPFTNSDI IRRFRDLNHYMEQILNVSHE VTN 
SDCVLSFFLSEAGQQTVEESSPVYLGEKFPDK 

PSAHVEFYLLPYPSEVRRRKTKSVPKCTDPTY 
NEIVVYDEVTELQGHVLMLrVKSKTVFVGAI 
NIRLCSVPLDKEKWYPLGNSI1*PLLLFSSFGM 
KSLE1CDEFVGGMLLSNPIW 


470 


1820 


A 


3718 


430 


75 


VLEGASCSVVSCPFLDGVCVSQKVSV/CWQ*/ 
CPWGARAEGRLSAVVDSOISOCfvOiDI fNAV 
VLAAGSPWALCVQLLLSLGSVFLWALL 


471 


1821 


A 


3723 


891 


494 


LRQSL/NSVPQAGVQWRDSSLQAPPPRFTPLS 
CLSLPS S WDYRRLPPCLANFLYF* * RRGFTML 
ARM\ r LIS*PRDPPASASQ\STEITGGSHRAQHP 
TDSRDHSERSVKXSHEVISELRMKVIKCKVAF 
SKNP1 


472 


1822 


A 


3734 


443 


251 


GFIET*NFCVSKDTSKKLS/RLPTKWKNVFAN 
^ISDKGLVSRICQELLRHLDAEQVSSTAGLSL 


473 


1823 


A 


3746 


3 


500 


THASGGARSGAGWAGRGVRAGTEAGRGGIF 

LTLSILRTRDLPSGAMSEGVDLIDIYADEEFNQ 

DPEFNNTDQIDLYDDVLTATSQPSDDRSSSTE 

PPPPVRQEPSPKPNNKTPAILYTYSGLRNRRA 

AVYVGSFSWWTTDQQLIQVIRSIGVYDYGEV 

KFAENRAK 


474 


1824 


A 


3753 ; 


2 


5262 


RPLFAREGGIYAVLVCMQEYKTSV\LVQQAG 

LAALKMLAVASSSEIPTFVTGRDSIHSLFDAQ 

MTREIFASIDSATRPGSESLLLTVPAAVILMLN 

TEGCSSAARNGLLLLNLLLCNHHTLGDQ1ITQ 

ELRDTLFRHSGLAPRTEPMP7TRTILMMLLNR 

YSEPPGSPVERAALETPIIQGQDGSPELLIRSLV 

GGPSAEIXLDI.ERVLCREGSPGGAVRPLLKRL 

QQETQPFLLLLRTLDAPGPNKTLLLSVLRVIT 

RLLDFPEAMVLPWHEVLEPCLNCLSGPSSDSE 

IVQELTCFLHRLASMHKDYAVVLCCLGAKE1 

LSKVLDKHSAQLLLGCELRDLVTECEKYAQL 

YSNLTSSIXAGC1QMVLGQIEDHRRTHQPINIP 

FFDW1.RHLCQGSSVEVK.EDKCWEKVEVSSN 

PHRASKLTDHNPK.TYWESNGSTGSHYiTLHiVT 

HRGVLVRQLTLLVASEDSSYMPARWVFGG 

DSTSCIGTELNTVNVMPSASRVILLENLNRFW 

PIIQIRIKRCQQGGIDTRVRGVEVLGPKPTFWP 

LFREQLCRRTCLFYT1RAQAWSRDLAEDHRRL 

LQLCPRLNRVLRHEQNFADRFLPDDEAAQAL 

GKTCWEALVSPLVQNITSPDAEGVSALGWLL 

DQYLEQI^TSRNPLSRAASFASRVRRLCHLL 

VHVEPPPGPSPEPSTRPFSKNSKGRDRSPAPSP 

V1.PSSSLRNITQCWLSWQEQVSRFLAAAWR 

APDFVPRYCKL YEHL QR/ GSELFGPRAAFML 
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WO 01/57188 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine O-Cysteine, 
D=Aspartic Acid. E=01utamic Acid, 
^Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine ; L=Leucine, 
M==Methionine, N=Asparagine, P=Proline ; 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, W^Tryptophan. 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, \=possib)e 
nucleotide insertion 














ALRSGFSGALLQQSFLTAAHMSEQFARYIDQ 

QIQGGLIGGAPGVEMLGQLQRHLEPIMVLSG 

LELATTFEHF YQH YMADKLL SFG S S WLEG AV 

LEQ1GLCFPNRLPQLMLQSLSTSEELQRQFHLF 

QLQRLDKLFLEQEDEEEKRL*EEEEEEEEEEA 

EKELFIEDPSPAISILVLSPRCWPVSPLCYLYHP 

RKCLPTEFCDALDRFSSFYSQSQNHPVLDMG 

PHRRLQWTWLGRAELQFGKQILHVSTVQMW 

LLLICFNQTEEVSVETLLKDSDLSPELLLQALV 

PLTSGNGPLTLHEGQDFPHGGVLRLHEPGPQ 

RSGE ALWLIPPQ A YLN VEKDEGRTLEQKRNL 

LSCLLVR1LKAHGEKGLH1DQLYCLVLEAWQ 

KGPNPPGTLGHTVAG G VACTSTD VL SCILHLL 

GQGYVKRRDDRPQILMYAAPEPMGPCRGQA 

DVPFCGSQSETSKPSPEAVATLASLQLPAGRT 

MSPQEVEGLMKQTVRQVQETLNLEPDVAQH 

LLAHSHWGAEQLLQSYSEDPEPLLLAAGLCV 

HQAQAVPVRPDHCPVCVSPLGCDDDLPSLCC 

MHYCCKSCWNEYLTTR1EQNLVLNCTCPIAD 

CPAQPTGAFIRAIVSSPEVISKYEKALLRGYVE 

SCSNLTWCTNPQGCDRILCRQGLGCGTTCSK 

CGWASCFNCSFPEAHYPASCGHMSQWVDDG 

GYYDGMSVEAQSKHLAKLISKRCPSCQAPIE 

KNEGCLHMTCAKCNHGFCWRCLKSWKPNH 

KDYYNCSAMVSKAARQEKRFQDYNERCTFH 

HQAREFAVNLRNRVSAIHEVPPPRSFTFLNDA 

CQGLEQARKVLAYACVYSFYSQDAEYMDVV 

EQQTENLELHTNALQILLEETLLRCRDLASSL 

RLLRADCLSTGMELLRRIQERLLAILQHSAQD 

FRVGT XJSPSVEA WEAKGFNMPGSQPQ ASSGP 

EAEEEEEDDEDDVPEW QQDEFDEELDNDSFS 

YDESENLDQETFFFGDEEEDEDEAYD 


475 


1825 


A 


3754 


1093 


96 


" GTSRNQHSPKTHA*RSS/WPQPPPLFLPPLQPQ 
ATGRRRRRTRTQQRTAALLTDGTTKTGAAW 
SRRPSLCWPSRTTGAPGAK+AVLVRSATPTTN 
PPNPQSPTGAAGKLRAPGNRAG/SEPSSQEPPP 
DGTR\RPAS1TGVAQSPATRATPSLPCLHVPAP 
SRGQTLGVRTTGRASRLTVDRSRLSWPGRSA 
RSGGGR\VRPNAPRGRWPRAP*S WEPGb W 1 r, 
PWRWPFPAAESPPHRCIYCTNHVSPAGPARPS 
HVYIIRATINSISHPLCRAQSSPWEAAGVWRR 
PAQPAPTSDVNlNLL,KJsJ J KV Y Kir IAjIN 

TLWEEGRQRPPETLQPAR 


476 


1826 


A 


3758 


901 


521 


" FFFGNGVSPCPQAGV*WHDLDSLQNLPPGFK 
RFS YLSLPS S W\D YRH VPPRQ ANFCIF/M * RRG 
FTMLARMVSIS*PRDLPALASQSAGITGVSHH 
APPQMDFTFALLCFAPKGCLPRQKEGGTLNLI 


477 


1827 


A 


3761 


843 


575 


" GVISAHCNLRL/CHLPGSSNSPASASQVAGTIG 
ARTTPS* IF VFLVETGFHH VSQDGLDLL/NFVI 
RPRRPLKVLGLQACTRARLPSPLKEL 


478 


1828 


A 


3763 


267 


1240 


HLLSFHLWSASLDCLEQLSQERHVKGMLLGP 

ppvtm F9TKPSPSPWKXTPPMCSIPPVFPPKSG S 

PTTS WS/PSGHSKLEVERAQTGPFCLHI YCP* P 

GVTDNTTSLLHYIPFPRL\SGLVCFPAH*FPSY 

WTGHSFASQAWLRQVPEVSKHLQCPSAESLL 

TMEYHQPEDPAPGKAGTAEAVIPENHEVLAG 

PDEHPQDTDARBADGEAREREP/RRPSFAA*P 

VWGQPVESPLPEASSAPPGPTLGTLPEVETIRA 

CSMPQELP*SPRTRQPEPDFYCVK\V1PWKGE 

QTPIITQSTNGPLPSPCHHEHPLSSVEGEAPPA 



193 
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SEQ ID 
NU: ox 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NU: Oi 
peptide 
seq- 
uence 


Met 
nod 


SEQ 

ir\ MA, 

ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G^Glycine, H=Histidine, 
I=Iso leucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, $=Serine, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknovvn, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














EGSDH1G 


479 


1829 


A 


3766 


2 


2152 


YSPIRLLEVCVPLPIGFIKRQAPLKVSLLQDLK 

DFFQKVSQVYVAIDERLASLKTDTFSKTREEK 

MEDIFAQKJEMEEGEFKNWIEKMQARLMSSS 

VDTPQQLQSWESL1AKXQSLCEVLQAWNNR 

LQDLFQQEKGTUCRPS VPPSPGRLRQGEESKIS 

AMDASPRNISPGLQNGEKEDRFLTTLSSQSST 

SSTHLQLPTPPEVMSEQSVGGPPELDTASSSE 

DVFDGHLLGSTDSQVKEKSTMKAIFANLLPG 

NSYNPIPFPFDPDKHYLMYEHERVPIAVCEICE 

PSSIIAFALSCKEYRNALEELSICATQWNSAEE 

GLPTNSTSDSRPICSSSPIRLPEMSGGQTNRTTE 

TEPQPTKKASGMLSFFRGTAGKSPDLSSQKRE 

TLRG ADS A Y YQ VGQTGKEGTENQG VEPQDE 

VDGGDTQKKQLINPHVELQFSDANAKFYCRJL 

Y YA GEFHKMRE VILDSSEEDF1RSLSHS SP WQ 

ARGGKSGAAFYATEDDRFILKQMPRLEVQSF 

LDFAPH YFN Y1TNA VQQKRPTAJLAKILGVYRI 

GYKNSQNnTEKKLDLLVMENLFYGRKNIAQ 

VFDLKGSLRNRNVKTDTGKESCDVVLLDENL 

LKMVRDNPLYIRSHSKAVLRTSIHSDSHFLSS 

HLIIDYSLLVGRDDTSNELVVGIIDYIRTFTWD 

kklemvvkstgilggqg*mptvvspelyrtr 
fceamdnyflmvpdhctglglnc 


480 


1830 


A 


3777 


251 


3 


QGCGSAGTLIHY**ECKMVQLLWKTV*QFU 
KLNI\KDPAITLDVYPNEVKNYVRTKTYTQMF 
1/ AN r IM AKb WKQPTHPS VRT 


481 


1831 


A 


3779 


333 


3 


eaairqpepnildvnqrfkdlamilhdqgdlid 
sieanaessevlverapgqlqrpayyyqkksr 
kkmclvvlvqtaiilicerim*vvyttkwsppi 
vlpvscfqgqkfn 




1 O30 

1 ojZ 


A 

A 


3 7oU 


z 


371 


tggrqgkndhtsitekpsrdfnrhlitqni*m 
pnqdmksssnsliirkvqikptilyhhiftrka 
kmkttdktkyr*gfkaittlihcsqdcki-q* s 
/l* enhfm1fpkaeqhit ydttipflr 


483 


1833 


A 


3787 


43 


448 


lmkdlspyvmethyilnrlner/rsmwrhiig 
klpnticdqekilkairgrreviqgs/rqqyrr 
paafsaaekarrlwcs/vfnierrnl/ceyptk 
lsfnikgemtfsdktefttnrpslkmllkdrl 

QEEGKMr *KEKCh KRKJE 


484 


1834 


A 


3798 


1 


727 


FFFFETESRSVAQAGVQWCNLGSLQALPPGFX 
SIISPASASRVAGTTGTRII*ARLIFYIFSRDGVS 
Ix* ruWb*brDLVIRPr\I<J^PKCWDYRR£PPRP 
A * FF VFI , VE\QGFTML ARM VSTS * PQ/CDLP A S 
VSQNAGITGVSHCAWPCLHFCFFGFFFEMESC 
o V A^A±,vi^ W rlUJLKbJLv^APFPOr IPrSCL-SLPG 
SWDYRRPPPRPANF\CIFSRDGVSPC*PGWSRS 
PDLVIRPPRPPKVLGLQA 


485 


1S35 


A 


3802 


1 


239 


FFFFEMECLTVSQAGVQWYNLHSLQPLPPGF 
KQFSC\LSLPSSWD*RVPTSRPAKF/CVIF*DGV 


486 


1836 


A 


3811 


378 


98 


RYD*SSQSENIP\QKEFLLKYP*CTATLGMRN 

MSIMKXKSIFSAEFYKVSLPSLLLXHLLAIEWG 

FHIE1QLTIHQHFLNYELESDFVHIVEYM 


487 


1837 


A 


3814 


771 


320 


FDPDWTRAAG1RHEKJCPKALAYRRENSPGDL 

PPPPLPPPEEEASWAUGAEGSRQHVLPGAGA 

QWGEESGPGRAPGSPAGAPPR*RGLAP\NSRP 

SFLSRGQGTSTCSTAGSNSSRGSSSSRGSRGPG 

RSRSRSQSRSQSQRPGQKRREEPk" 
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3DOCID: <WO 0157188A2_I_> 



WO 01/57188 



PCT/U SO 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C-Cysteine, 
D=Aspartic Acid. E=Glutamic Acid, 
F=PhenyIalanine, G^Glycine, UHHistidine, 
I=Isoleucine, K=Lysine 5 L=Leucine, 
M^Methionine, N=Asparagine, P=Proline, 
Q=Glutamine. R=Arginine, S^Serine, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unlcnown, *=Stop codon, 
/^possible nucleotide deletion, ^possible 
nucleotide insertion 


488 


1838 


A 


3818 


1 


781 


FRACLLEL1PYAPTLSW 1 ACJfFAMACil'KUJLJLJf 

LCLLAFCLAGFSFVRGQVLFKGCDVKTTFVT 

HVPCTSCAAIKKQTCPSGWLRELPDQITQDCR 

YEVQLGGSMVSMSGCKivXCKJsA^v vgKALLr 

GYWGSRCHECPGGAETPCNGHGTCLDGMDR 

NGTCVCQENFRGSACQECQDPNRFGPDCQSV 

CSCVHGVCNHGPRGDGSCLCFAGYTGPHCD 

QELPVWQELGFPQNNPRLRKAPNCKCLPG*H 

RNGLIATPNPCRP 


489 


1839 


A 


3822 


934 


669 


FFFSEMESRSVTRLECSGAISAHLRLLGSSNSP 
ASAS*VAGTIGACHHAQLIFVFLVETGFHHVG 
QDGLDLL/NLMIHPPRPPKVLGFQA 


490 


1840 


A 


3825 


79 


9748 
1 


GCQSCWPAWPRLRRRGPASAGARLGRKAPW 
GLPGRVQDGRPLRFCFYLRPRAPFIAPVLSGA 
ASRPEASGDCRAGRETAMATLEKLMKAFESL 

KSFQQQQQQQQQQQQQQQQQQQQQQQ pppp 

PPPPPPPQLPQPPPQAQPLLPQPQPPPPPPPPPP 

GP A V AEEPLHRPKKEL S ATKKDR VNHCLTIC 

ENIVAQSVRNSPEFQKLLGIAMELFLLCSDDA 

ESDVRMVADECLNKVIKALMDSNLPRLQLEL 

YKEIKKNGAPRSLRAALWRFAELAHLVRPQK 

CRPYLVNLLPCLTRTSKRPEESVQETLAAAVP 

KIMASFGNFANDNEIKVLLKAFIANLKSSSPT1 

RRTAAGSAVSICQHSRRTQYFYSWLLNVLLG 

LLVPVEDEHSTLLILGVLLTLRYLVPLLQQQV 

KDTSLKGSFGVTRKEMEVSPSAEQLVQVYEL 

TLHHTQHQDHNVVTGALELLQQLFRTPPPEL 

LQTLTAVGGIGQLTAAKEESGGRSRSGSIVELI 

AGGGSSCSPVLSRKQKGKVLLGEEEALEDDS 

ESRSDVSSSALTASVKDE1SGELAASSGVSTPG 

SAGHDHTEQPRSQHTLQADSVDLASCDLTSS 

ATDGDEEDILSHSSSQVSAVPSDPAMDLNDG 

TQASSPISDSSQTTTEGPDSAVTPSDSSEIVLD 

GTDNQYLGLQIGQPQDEDEEATGILPDEASEA 

FRNSS IVLALQQ AHLLKNMSHCRQPSDSS VDKF 

VLRDEATEPGDQENKPCR1KGDIGQSTDDDS 

APLVHCVRLLSASFLLTGGKNVLVPDRDVRV 

SVKALALSCVGA.A.VALHPESFFSKLYKVPLD 

TTEYPEEQYVSDILNYIDHGDPQVRGATAILC 

GTLICSILSRSRFHVGDWMGTIRTLTGNTFSL 

ADCIPI XRKTLKDESS VTCKL ACTAVRNCVM 

SLCSSSYSELGLQLI1DVLTLRNSSYWLVRTEL 

LETLAEIDFRLVSFLEAKAENLHRGAHHYTGL 

LKLQERVLNNVVIHLLGDEDPRVRIIVAAASL 

IRLVPKLFYKCDQGQADPVVAVARDQSSVYL 

KLLMHETQPPSHFSVSTITRIYRGYNLLPSITD 

VTMENNLSRVLAAlVSHELITSTTRALTFGCCE 

ALCLLSTAFPVCIW SLG WHCG VPPLS ASDESR 

KSCTVGMATMILTLLSSAWFPLDLSAHQDAL 

ILAGNLLAASAPKSLRSSWASEEEANPAATK 

QEEVWPALGDRALVPMVEQLFSHLLKVINIC 

AHVLDDVAPGPAIKAA1,PSLTNPPSLSPTRRK 

GKEKEPGEQASVPLSPKKGSEASAASRQSDTS 

GPVTTSKSSSLGSFYHLPSYLKLHDVLKATHA 

N YK.VTLDLQN STEKFGGFLRS ALD VLSQILEL 

ATLQD1GKCVEEILGYLKSCFSREPMMATVC 

VQQLLKTLFGTNLASQFDGLSSNPSKSQGRA 

QRLGSSSVRPGLYHYCFMAPYTHFTQALADA 

SLRNMVQAEQENDTSGWFDVLQKVSTQLKT 

NLTSVTKNRADKNAIHNHIRLFEPLVIKALKQ 
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MSDOCID- <WO 0157188A2_I_> 



WO 01/57188 




T/US0 1/03800 



SEQ ID 
XnU: oi 
nud- 
eotide 
seq- 
uence 


SEQ ID 
rlKJ. oi 
peptide 
seq- 
uence 


Met 

U. „ r l 

nou 


SEQ 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine. G=Glycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V= Valine. W— Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














YTTTTCVQLQKQVLDLLAQLVQLRVNYCLtT" 

DSDQVFIGFVLKQFEYIEVGQFRESEAIIPNIFF 

FLVLLSYERYHSKQIIGIPKIIQLCDGIN4ASGR 

KAVTHAIPALQPIVHDLFVLRGTNKADAGKE 

LETQKEVVVSMLLRLTQYHQVLEMFILVLQQ 

CHKENEDKWKIiLSRQlADIILPMLAKQQMHI 

DSHEALGVLNTLFEILAPSSLRPVDMLLRSMF 

VTPNTMASVSTVQLWISG1LAILRVLISQSTED 

IVLSRIQELSFSPYLISCTVINRLRDGDSTSTLE 

EHSEGKQIKNLPEETFSRFLLQLVGILLEDIVT 

KQLKVEMSEQQHTFYCQEIXiTLLMCLIHIFKS 

GMFRR11AAATRLFRSDGCGGSFYTLDSLNLR 

ARSMITTHPALVLLWCQILLLVNHTDYRWW 

AEVQQTPKRHSLSSTKLLSPQMSGEEEDSDLA 

AKLGMCNREIVRRGALILFCDYVCQNLHDSE 

HLTWLIVNHIQDL1SLSHEPPVQDFISAVHRNS 

AASGLFIQAIQSRCENLSTPTMLKKTLQCLEGI 

HLSQSGAVLTLYVDRLLCl'PFRVLARMVDIL 

ACRRVEMLLAANLQSSMAQLPMEELNR1QEY 

LQSSGLAQRHQRLYSLLDRFRLSTMQDSLSPS 

PPVSSHPLDGDGHVSLETVSPDKDWYVHLVK 

SQC WTRSDSALLEGAELVNRIPAEDMNAFM 

K4NSEFNLSLLAPCLSLGMSEISGGQKSALFEA 

AREVTLARVSGTVQQLPAVHHVFQPELPAEP 

AAYWSKLNDLFGDAALYQSLPTLARALAQY 

LWVSKLPSHLHLPPEKEKDIVI^VVATLEAL 

SWIILII-IEQIPLSLDLQAGLDCCCLALQLPGL 

WSVVSSTEFVTHACSLIYCVHFILEAVAVQPG 

EQLLSPERRTNTPKAISEEEEE VDPNTQNPK YI 

TAACEMVAEM\^ESLQSVLALGHKRNSGVPA 

FLTPLLRNI1ISL ARLPL V N S YTRVPPL V WKLG 

WSPKPGGDFGTAFPEIPVEFLQEKEVFKEFIYR 

INTLGWTSRTQFEETWATLLGVLVTQPLVME 

QEESPPEEDTERTQINVLAVQAITSLVLSAMT 

VPVAGNPAVSCLEQQPRNICPLKALDTRFGRK 

LSIIRGIVEQEIQAMVSKRENIATHHLYQAWD 

PVPSLSPATIGALISHEKLLLQINPERELGSMS 

YKLGQVSIHSVWLGNSITPLREEEWDEEEEEE 

ADAPAPSSPPTSPVNSRKHRAGVDIHSCSQFL 

LELYSRWILPSSSARRTPAILISEVVRSLLVVS 

DLFTERNQFELMYVTLTELRRVHPSEDEILAQ 

YI /VTATCKAAAVJLGMDKA VAEPVSRLLESTL 

RSSHLPSRVGALHGVLYVLECDLLDDTAKQL 

IPVISDYLLSNLKGIAHCVNIHSQQHVLVMCA 

TAFYLIENYPLDVGPEFSASIIQMCGVMLSGS 

EESTPSIIYHCALRGLERLLLSEQLSRLDAESL 

VKLSVDRVNV^SPHRAMAALGLML/TCMYT 

vjrJvtKVIirOKl oDPNPAAPDSESVIVAMERVS 

VLFDRIRKGFPCEARVVARILPQFLDDFFPPQ 

DIMNKVIGEFLSNQQPYPQFMATVVYKVFQT 

LHSTGQSSMVRDWVMLSLSNFTQRAPVAMA 

TWS1 SCFFVSASTSPWVA AIT PHVISRMHK1 F 

QVDVNLFCLVATDFYRHQ1EEELDRRAFQSV 
LEWAAPGSPYHRJLLTCLRNVHKV1TC 


491 


1841 


A 


3826 


469 


302 


SNPPASASRVAGITGVHQHAWLIFVFLVEMEF 
HHVGQAVLKLLISGDLPVSASQSA 


492 


1842 


A 


3836 


392 


88 


VAPSPMIMPDLYFYRDPEEIEKEE*AAAEK\EE 
FQSEWTAVV/P/EFTATQSEVADWFKDMQVP 
SVPIQQFPTEDWS'l^PrMNDWSATSTAQTTE 
WVRITTEWP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIamne C=Cysteme, 
LJ— Aspartic ACia, £=^iuitiiiiiL, ^\oiu, 
F=Phenylalanine ; G=Glycine, H=Histidine 3 
I=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 

(~\ — n\i il?»Tnin^ P. =: A rtnninf* S— SlpHn f 
— OlUlaUlinc, In. — /UgllllllG, o iJtiiiiv., 

T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 

niifl*»Atirlp in<;prtir»n 

1 1 lidCU IIVJU' lllowl 111/11 


493 


1843 


A 


3838 


19 


380 


TPSDMNRAFETDTQSIGEKNRSPSEPDYFERK 
KFKRS*EKAHIRYKIDQPEDIPLK\EFLCKHSK 
CTATLSMRNMSLMKKKCSFSEEFVLAFFPSLL 
VCHLLAiKLGFYIElHLTTFNNTF 


494 


1844 


A 


3845 


2 


352 


FFFLRRSL/DSVAQAEAQWL\ELGLLQAPPPGF 
KPISLP\GLPSSWDYGRPPPCPANFCIF/M*RRG 
FTVL ARMVLIS*PCDPPTL ASQGT AITGM S YH 
ARPQDIDFLYAHQGRCWFRLL 


495 


1845 


A 


3847 


1774 


40 


DIFFRRAKEGMGQDEAQFSVEMPLTGKAYL 

WADKYWRKPRFFNRVHTGFEWNK^'NQTHY 

DFDNPPPK1VQGYKFNIFYPDLIDKKSTPEYFL 

EACADNKDFAILRFHAGPPYEDIAFKIVNREW 

EYSHRHGFRCQFANGIFQLWFHFKRYRYRR* 

KPWGTAGRCPRGHSKGASVKLWTPGPLSGL 

QGRGFTSHLRPHLSFARPQFPPI*KGGHH*AC 

HGELRRHWDRL A* GPDATEGALGASFEHEG 

GQQPPADLTVQADTLHRPSARLGGAHRACPK 

RRPHRVLWRWARGAWAWRCQAREKQETQG 

y—\T*s^<r rinrr^r tt>t pde ACDA A AHA APAT AT4P. PPF 
QPCH1 1 OHrJLOK±^A±l/l A>\>Vvj/V/Vr /\l^^vri xvr r r 

ARTGSTE\PGPCWRPIRHCRRDPLWTPTLC\RD 

WPPTHPVLAGGVHFPAAG/IGGCVEVPVSVN 

VMGTKSH*AVLPPPPSTGPGGQGLPEGWGLE 

TVAPSHSPFSGQEGRGPHGCHSPGRVSGPVAGR 
LVLQHPTGTSPTEAKRKVPPGPPEGHPTSPVT 
SPRPPTAPPRHPASSGNSSVCFSICKTCRWEKK 
SFVLMELAYWQDRMFF 


496 


1846 


A 


3849 


830 


442 


Ak'QPi pt r,*Tn\VP/NT n^T KT RI PGFK*FTCLG 
LLSSWDYRSLPPRPVNFCILVELGFHHVDQAG 
LKLLTSSALPALASQSAEITGMSHR1WPLPLLR 
RPPVIRIRAPPQRLPFNLITS LKALSPNMATF 


497 


1847 


A 


3859 


2 


393 


at DYTPpnnTARTrtAOPAASWKGTNNYPWR 

LEMAGRPGSQEQSKDRGTGSLPPPSQRPLGPS 

PEGAGPSPPPPGIPRGGGSSSSEGP/PQLLFVPR 

RFPAPKKGLPSDTPHSKAPPTPHL1LGGEDSQ 

VPIL 


498 


1848 


A 


3860 


253 


634 


KNASTVYSSQGDPKSFFFLLRWSLALVAQAG 
umnni cot 0PPPPnFK"*P^PT °,T PSSWDWRCP 
LPCLANF\*FLVETGFHHVGQADLICLLTSGDP 
PTSASESAGITGVSHRAWPRIHFLYWKTFFL 


499 


1849 


A 


3863 


423 


263 


APSQISVAFLYAA/DKLFEKEI*KKIPFIIAS/DKI 
KIGINLTK-EVKYLYFENY1TLMKEIK7DTDKW 
KDILY* WIGKINI+04STPPKAI YRFN AIPTK1P 
MTFFTEIEKSIIKFI WNIiKKPPNTQ SNIEQKE* S 
FCSILLWVFGGFLWFHMNFMIDFSISVKNVIGI 
LVGIALNL 


500 


1850 


A 


3865 


2 


15246 


LPRGCLWCLQRSPTPARPQPSRl^ARSPLPLFP 

DLRPWASDLDIMGDAEGEDEVQFLRTDDEV 

VLQCSATVLKEQLKLCLAAEGFGNRLCFLEP 

TSNAQm^PPDLAICCFVLEQSLSVRALQEML 

ANTVEAGVESSQGGGHRTLLYGHAILLRHAH 

SRMYLSCLTTSRSMTDKLAFDVGLQEDAl'GE 

ACWWTMHPASKQRSEGEKVRVGDDIILVSVS 

SERYLHLSTASGELQVDASFMQTLWNMNPIC 

SRCEEGFVTGGHVLRLFIIGHMDECLTISPADS 

DDQRRLVYTEGGAVCTR^RSLWRLEPLRIS 

WSG SHLRWGQPLRVRH VTTGQYL AI TEDQG 

LVV\nDASKAHTKATSFCFRISKEKLDVAPKR 

DVEGMGPPEIKYGESLCFVQHVASGLWLTYA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond] 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine, 
D:kA cnArtie AciH F=Crlutam?c AeiH 

F=Phenylalanine, G=Glycine, H=Histidine 5 
I=Isolcucinc, ^Lysine. L=Leucine, 
M=Methionine s N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine ; V=Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possib!e 
nucleotide insertion 














APDPKAI-RLGVLKKJKAMLHQEGHMDDALSL 

TRCQQEESQAARM1HSTNGLYNQFIKSLDSFS 

GKJPRGSGPPAGTALP1EGVILSLQDL1IYFEPPS 

EDLQHEEKQSKLRSLRNRQSLFQEEGMLSMV 

LNCIDRLNVYTTAAHFAEFAGEEAAESWKEI 

VNLLYELLASLIRGNRSNCALFSTNLDWLVS 

KLDRLEASSGILEVLYCVLIESPEVLNIIQENHI 

K.SIISLLDKHGRNHKVLDVLCSLCVCNGVAV 

RSNQDLITENLLPGRELLLQTNLINYVTSIRPN 

IFVGRAEGTTQYSKWYFEVMVDEVTPFLTAQ 

ATHLRVGWALTEGYTPYPGAGEGWGGNGV 

GDDLYSYGFDGLHLWTGHVARPVTSPGQHL 

LAPEDVISCCLDLSVPSISFRINGCPVQGVFESF 

NLDGLFFPVVSFSAGVKVRFLLGGRHGEFKF 

LPPPGYAPCHEAVLPRERLHLEPIKEYRREGP 

RGPHLVGPSRCLSHTDFVPCPVDTVQIVLPPH 

LERIREKLAENIHELWALTRIEQGWTYGPVRD 

DNKRLHPCLVDFHSLPEPERNYNLQMSGETL 

KTLLALGCHV GM ADEKAEDNLKKTKLPKTY 

MMSNGYIvPAPLDLSHYRLTPAQTTLVDRLAE 

NGHNVWARDRVGQGWSYSAVQDIPARRNPR 

LVPYRLLDEATKRSNRDSLCQAVRTLLGYGY 

NIEPPDQEPSQVENQSRCDRVRIFRAEKSYTV 

QSGRWYFEFEAVTTGEMRVGWARPELRPDV 

ELGADELAYVFNGHRGQRWHLGSEPFGRPW 

QPGDVVGCMIDLTENTIIFTLNGEVLMSDSGS 

ETAFREiEIGDGFLPVCSLGPGQVGHLNLGQD 

VSSLRFFAICGLQEGFEPFAINMQRPVTTWFS 

KGLPQFEPVPLEHPHYEVSRVDGTVDTPPCLR 

LTHRTWGSQNSLVEMLFLRLSLPVQFHQHFR 

CTAGATPLAPPGLQPPAEDEARAAEPDPDYE 

NLRRSAGGWSEAENGKEGTAKEGAPGGTPQ 

AGGEAQPARAENEKDATTEKNICKRGFLFKA 

KKVAMMTQPPATPTLPRLPHDVVPADNRDD 

PEI ILNTTTY Y Y S VR VF AGQEPS C V W AG W VT 

PDYHQHDMSFDLSKVRVVTVTMGDEQGNV 

HSSLKCSNCYMVWGGDFVSPGQQGRISHTDL 

VIGCLVDLATGLMTFTANGKESNTFFQVEPN 

TKLFPAVFVLPTHQNVIQFELGKQKNIMPLSA 

AMFQSERICNPAPQCPPRLEMQMLMPVSWSR 

MPNHFLQVETRRAGERLGWAVQCQEPLTMM 

ALHIPEENRCMDILELSERLDLQRFHSHTLRL 

YRAVCALGNNRVAHALCSHVDQAQLLHALE 

DAHLPGPLRAGYYDLLISIHI.ESACRSRRSML 

SEYIVPL'iPETRAf TLFPPGRSTENGHPRHGLP 

GVGVTTSLRPPHHFSPPCFVAALPAAGAAEAP 

ARLSPAIPLEALRDKALRMLGEAVRDGGQHA 

RDPVGASVEFQFVPVLKLVSTLLVMGIFGDE 

DVKQILKMIEPEVFTEEEEEEDEEEEGEEEDEE 

EKEEDEEETAQEKEDEEKEEEEAAEGEKEEG 

LEEGLLQMKLPESVKJLQMCHLLEYFCDQELQ 

HRVESLAAFAERYVDKLQANQRSRYGLLIKA 

FSMTAAETARRTREFRSPPQEQINMLLQFKDG 

TDEEDCPLPEEIRQDLLDFHQDLLAHCGIQLD 

GEEEEPEEETTLGSRLMSLLEKVRLVKKKEEK 

PEEERS AEESKPRS LQEL VSHMV VRW AQEDF 

VQSPELVRAMFSLLHRQYDGLGELLRALPRA 

YTISPSSVEDTMSLLECLGQIRSLLIVQMGPQE 

ENLMIQSIGNIMNNKVFYQHPNLMRALGMI:^ 

TVMEVMVNVLGGGESKEIRFPKMVTSCCRFL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H-Histidinc s 
I=Iso leucine, K=Lysine, L=Leucine, 
M^Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine 5 V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














CYFCR1SRQNQRSMFDHLSYLLENSGIGLGM 

QG STPLD V AAAS V IDNNEL AL ALQEQD LEK V 

VS YL AGCGLQSCPML V AKG YPDIG WKPCG G 

ERYLDFLRFAVFVNGESVEENANVWRLLIR 

KPECFGP ALRGEG G S GLL AAIEE AIRI SEDP AR 

DGPGIRRDRRREHFGEEPPEENRVHLGHAIMS 

FYAALIDLLGRCAPEMHLIQAGKGEALRIRA1 

LRSLVPLEDLVGHSLPLQIPTLGKDGALVQPK 

MSASFVPDHICASMVLFLDRVYGIENQDFLLH 

VLDVGFLPDMRAAASLDTATFSTTEMALAV 

NRY1XLAVLPLITKCAPLFAGTEHRAIMVDS 

MLHTVYRLSRGRSLTKAQRDV1EDCLMSLCR 

YIRPSMLQHLLRRLVFDVPILNEFAKMPLKLL 

TNHYERCWKYYCLPTGWANFGVTSEEELHL 

TRKLFWGIFDSLAHICKYDPELYRMAMPCLC 

AIAGALPPDYVDASYSSKAEKKATVDAEGNF 

DPRPVETLNVI1PEKLDSFINKFAEYTHEKWAF 

DKIQNNWSYGENIDEELKTHPMLRPYKTFSE 

KDKEIYRWPIKESLKAMIAWEWTIEKAREGE 

EEKTEKKKTAKISQSAQTYDPREGYNPQPPDL 

SAVTLSRELQAMAEQLAENYHNTWGRKKKQ 

ELEAKGGGTHPLLVPYDTLTAKEKARDREKA 

QELLKFLQMNGYAVTRGLKDMELDSSSIEKR 

FAFGFLQQLLRWMDISQEFIAHLEAVVSSGRV 

EKSPHEQEIKFFAKILLPLINQYFTNHCLYFLS 

TPAKVLGSGGHASNKEKEMITSLFCKLAALV 

RHRVSLFGTDAPAVVNCLHILARSLDARTVM 

KSGPEIVKAGLRSFFESASEDIEKMVENLRLG 

KVSQARTQVKGVGQNLTYTTVALLPVLTTLF 

QHIAQHQFGDDVILDDVQVSCYRTLCS1YSLG 

TTKNTYVEKLRPALGECLARLAAAIVIPVAFLE 

PQLNEYNACSVYTTKSPRERAILGLPNSVEEM 

CPDIPVLERLMADIGGLAESGARYTEMPHV1E 

ITLPMLC S YLPR W WERGPE APP S ALP AG APPP 

CTAVTSDHLNSLLGNILRIIVNNLGIDEASWM 

KRL AVFAQPTVSRARPELLQSHFIPTIGRI ,RKR 

AGKVVSEEEQLALEAKAEAQEGELLVRDEFS 

VLCRDLYALYPLLIRYVDNNRAQWLTEPNPS 

AEELFRMVGEIFIYWSKSHNFKREEQNFVVQ 

NE1NNMSFLTADNKSKMAKAGDIQSGGSDQE 

RTKKKRRGDRYSVQTSLIVATLKKMLPIGLN 

MC APTDQDI JTL AKTR Y A 1 .KDTDEE VREFLH 

NNLHLQGKVEGSPSLRWQMALYRGVPGREE 

DADDPEKIVRRVQEVSAVLYYLDQTEHPYKS 

KKAWHKLLSKQRRRAVVACFRMTPLYNLP 

THRACNMFLESYKAAWILTEDHSFEDRMIDD 

LSKAGEQEEEEEEVEEICKPDPLHQLVLHFSRT 

ALTEKSKLDEDYLYMAYADIMAKSCHLEEG 

GENGEAEEEVEVSFEEKQMEKQRLLYQQARL 

HTRGAAEMVLQMISACKGETGAMVSSTLKL 

GISILNGGNAEVQQKMLDYLKDKKEVGFFQS 

IQALMQTCS VLDLN AFERQNKAEGLGM VNE 

DGTVINRQNGEKVMADDEFTQDLFRFLQLLC 

EGHNNDFQNYLRTQTGNTTTINIIICT\T)YLL 

RLQES1 SDFYWYYSGKDVIEEQGKRNFSKAM 

SVAKQVFNSLTEYIQGPCTGNQQSLAHSRLW 

DA VVGFLI IVFAHMMMKL AQDSSQIELLKEL 

LDLQKDMVVMLLSLLEGNVVNGMIARQMV 

DMLVESSSNVEMILKJFDMFLICLKDIVGSEAF 

QDYVTDPRGLISKKDFQKAMDSQKQFSGPEI 
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SEQ ID 

INL/. or 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

1N\J. OI 

peptide 
seq- 
uence 


Met 
noo 


SEQ 

tt-\ yjjr\- 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, b— Ulutamic Acid, 
F=Phenylaianine } G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L-Leucine, 
M=Methionine, N=Asparagine, P=Proline ; 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














QFLLSCSEADENEMIN CEEFANRFQEPARDIG 

FNVAVLLTNLSEHVPHDPRLHNFLELAESILE 

YFRPYLGRIEIMGASRRIERIYFEISETNRAQW 

EMPQVKESKRQFIFDVVNEGGEAEKMELFVS 

FCEDTIFEMQTAAQISEPEGEPETDEDEGAGA 

AEAGAEGAEEGAAGLEGTAATAAAGATARV 

VAAAGRALRGLSYRSLRRRVRRLRRLTAREA 

ATAVAALLWAAVTRAGAAGAGAAAGALGL 

LWGSLFGGGLVEGAICKVTVTELLAGMPDPT 

SDEVHGEQPAGPGGDADGEGASEGAGDAAE 

GAGDEEEAVHEAGPGGADGAVAVTDGGPFR 

PEGAGGLGDMGDTTPAEPP 1'PEGSPILKRKLG 

VDGVEEELPPEPEPEPEPELEPEKADAENGEK 

EEVPEPTPEPPKJCQAPPSPPPKKEEAGGEFWG 

ELEVQRVKFLNYLSRNFYTLRFLALFLAFAIN 

FILLFYKVSDSPPGEDDMEGSAAGDVSGAGS 

GGSSGWGLGAGEEAEGDEDENMVYYFLEES 

TGYMEPALRCLSLLHTLVAFLCUGYNCLKVP 

LVIFKREKELARKLEFDGLYITEQPEDDDVKG 

QWDRLVLNTPSFPSm^WDKFVKRKVLDKJHG 

DIYGRERIAELLGMDLATLEITAHNERKPNPP 

rubbl WLMSIlJVKYQIWKPGVIFTDNSrLYLG 

W YMVMSLLGHYNNFFFAAHLLDI AMG VKTL 

RTILSSVTHNGKQLVMTVGLLAVVVYLYTVV 

Ar N r r KKr Y N K bbJDbJD b PDM KCDDMMTC Y L 

FHMYVGVRAGGGIGDEIEDPAGDEYELYRYV 

FDITFFFFVIVILLA1IQGLIIDAFGELRDQQEQV 

KEDMETKCFICG1GSDYFDTTPHGFETHTLEE 

TJXJT AMVN/TPT7T \/VI nvTVrM7T r r7UT , ^TMrc V\/\\/T^ 

HJNbAfs YIVU^rbMYbtNlvLJbl bril Ol^boY V Wlv 
MYQERCWDFFPAGDCFRKQYEDQLS 


501 


1851 


A 


3869 


467 


665 


VIVAIYCQLIFDKGAKTrQ*PFQQIAL/CKRMK 
LGPCFTPCGKTNSEWIRELSVRVKTIKHLEIGV 
N 


502 


1852 


A 


3888 


1042 


724 


SGMQWRDLTPLQPLPPRFKQFSCLSLPGSWD 
YRHAP\PLLTNF\*FLVEMGFCYVGQAGRKLL 
AboJJ^oAbAbv^oAUl L urlo 1 ArCjr'rrrrLrNrbA 
GSCSVAQAGVQ 


503 


1853 


A 


3891 


1773 


1193 


EVDSQSG VQ* QAPGSLQLQTPGLK/VSCLLSR 
Y KooLrrlLAoLCY Y Y Y Y Y/ VrL'KKuLi IL 
VQGGLKLLPSSNPF ASAP* TAG 1TGMSHC AGP 
HFNF*MFRKI SCIRE* F*HTRIYDIPFL1LFFKET 
WVLLCYPG WPQIPGLKPSSCLRLLSS WDHRC 
APPCPASFFIFHVDRVSPPCPGLVSITFICMLLL 
L 


504 


1854 


B 


3896 


279 


70 


MVSKSKSILMSWHVELTFSDN4KKMPEAFRR 

TQKHTIYLIPYQVIFWSTGKDAMRSFMMPFY 

QKEYYENQ* 


505 


1855 


A 


3899 


2 


1396 


EPGVPTKKTWFDKPDFNRTNSPGFQKKVQFG 
NENTKLELRKVPPELNNI SKLNEHFS RFGTL V 
NLQVAYNGDPEGALIQFATYEEAKKAISSTEA 

V LINlNrvTllV V I W rlivCkJo 1 l^l^J_v^ 1 J or K. V lvivJr\L 

VQQPILPWKQSVKERLGPVPSSTIEPAEAQS 

ASSDLPQVLST\LLA*QKQCIIQLL/WKAAQKT 

LLVSTSAVDNNEAQKKKQEALKLQQDVRKR 

KQEILEKHIETQKMLISKLEKNKTMKSEDKAE 

TMKTLEVLTKNITKLKDEVKAASPGRCLPKS1 

FCTKTQMQKELLDTELDLYKKMQAGEEV'IEL 

RRKYTELQLEAAKRGILSSGRGRGIHSRGRGA 

VHGRGRGRGRGRGVPGHAVVDHRPRALEIS 
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SEQ ID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
MO: of 
peptide 
seq- 
uence 


Met 

hod ] 


SEQ 1 
CD NO: 1 
m 

USSN 
09/496 
914 


Predicted 
Deginning 
nucleotide 
ocation 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-Alanine C=Cysteine, 
D=Aspartic Acid. E=Glutamic Acid, 
F=Phenylalanine, Glycine, H=~Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
[s/1— Methionine, in— A-spardginc, r numit, 
Q=G iutamine, R=Arginine, S=Serine, 
T-Threonine, V-V aline, W-Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














AFTESDREDLLPHFAQYGEIEDCQ1DDSSLHA 
VITFKTRAEAEAAAVHGARFKGQDLKLAWN 
KP VTNI S A VETEE VEPDEEEQREIII A 


506 


1856 


A 


3911 


1952 


919 


DAELSGTLSLVLTQCCKRIKDTVQKLASDHK 

DIHSSVSRVGKAIDKNFDSDISSVGIDGCWQA 

DSQRLLNEVM VfcHr r KQUML,JJ v/uie,i^ 

GLSVDPSQ1CEPFVELKRJLEALKVRVLRPALE 

WAVSNREMLIAQNSSLEFKLHRLYF1SLLMG 

GTTNQREALQYAKNFQPFALNHQKD1QVLM 

GSL V YLRQGIEN SPYVHLLDANQ W ADICDIFT 

RDACALLGLS VJfcSr Lb V iAub v /\i^r/vj^iiNirw 

A VIEQRQCTGV WNQKDELPIEVXDLG* KS AG Y 

HSIFACP1LRQQTTDNNPPMKLVCGHIISRDAL 

NKMFNGSKLKCPYCPMEOSPGDAKQIFF 


507 


1857 


A 


3936 


439 


18 


SHPFSPAPGlCJr DAr r r LrKx oK.vjL>vjrir^j i -^vvjrv 
PGSGARCHPPSTCSPS WASPG* GAKASPAJLPR 
SHGVTLLCKAQAHLCRGEDSKDASGSTSQA 
WEPG* G A WGMPRCQGPALGSCFCPPGTTVQ 
RP AK Q RD K-KJN KJtiLoK 


508 


1858 


A 


3944 


120 


412 


WCPAGTLDFPGPQEMVLLEIEVMNQLNHRNL 

]QL YAAIE1 rrlJbl VLrJVLfcA I fcL/riv w vji^tvtvt i 
TRHGASRpG V CAHSIEGGELFERIVDED YHLT 
EV 


509 


1859 


A 


3949 


31 


392 


LTKTPSPREKGRGVLS VLLMMI* KCRV IF VKIP 
MVFFLQNFC/RIILNVA\WTGD*PNTL*KEQRG 
ITFSDSKS* YKATKJKTMWYCH1CNRYID/ERN 
RTE1PEINPCICDKIIFRKLSMTTQ 


510 


1860 


— 


3954 


1013 


885 


" FSETRACCPRLEHSGRIEAHCSLNIPUSSUPP 1 
SASSVAATTG 


511 


1861 


A 


3956 


1 


1054 


' PPAWAPRSPLIWAPTSGRHPCRAALPWSTSSV 
RWQPSEKQPPPPAHRGPADSLSTAAGAAELS 
AEGAGKSRGSGEQDWVNRPKTVRDTLLALH 
QHGHSGPFESKPKJCbFAJL I A V Ai< lAKJsaus^r^ 
PEPEGEVGPPK\TTERPSRGCPHPQRGSRSP*L 
LHPLLCLRHHPLPHLIPTGPHRLKRPRM\P\SP 
MAAL1LVADNAGGSHASKDANQVHSTTRRN 
SNSPPSPSSMNQRRLGPREVGGQGAGNTGGL 
EPVHPASLPDSSLATSAPLCCTLCHERLEDTH 
FVQCPSVPSHKFCFPCSRQS1KQQGASGEVYC 
PSGEKCPLVGSNVPWAFMQGEIATILAGDVK 
VKKERDS 


512 


1862 


A 


3957 


1086 


3 


QDRARLDCSSATSAHCNLRLPGS*DSPASASK 

VAGTTDTHHHTWLILGSSVQTGFDHVGQAG 

LELLTSGDPPISASESAGIMGMSHCVWP*SWG 

LSHHMAPPQGDCjCjRAKu 1 ruriivor w inj_.J)1^ 

H*PRCQVPS*LMTQL/FWGRHQYNPTMKRGK 

LRHREACSLPLPGEGEPGLQPSS\* SQNPCSSPL 

FHHGL*AWLWCPELLLQGQARRH*RSPPS/FK 

CPATLSLTAWSQTKRLRSQFLLLPWL*RAL*H 

pp\CHWPSRRSLGDPLLPRSQG*RDGT*ASTFC 

SYF*DTESHLVAQAGVQWRDLGSLQPPCPRL 

KVRFSRLSPPSSYTHRYVPSIILAESCISSRDRIP 

PSRPDRSRNSNSLSR 


513 


1863 


A 


3961 


3038 


476 


' VALTTSMCCNKQVIV1DKIKSASIADRCGALH 
VGDHILSrDGTSMEYCTLAEATQFLANTrDQ 
VKLE1LPHHQTRLALKGPDHVKIQRSDRQLT 
WDSWAS>mSSLHTNHHYNTYHPDHCRVPAL 
TFPKAPPPNSPPALVSSSFSPTSMSAYSLSSLN 
MGTLPRSLYSTSPRGTMMRRRLKKKDFKSSL 
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Amino acid sequence (A=Alanine OCysteine, 

Pv= A snprtir* Arirl F=Cihitamic Ariri 

F=Phenylalanine, G=Glycine, H=Histidine, 
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SLAS ST VGL AGQ WHTETTE V VLTADP VTGF 

GIQLQGSVFATETLSSPPLISYIEADSPAERCG 

VLQIGDRVMAINGIPTEDSTFEEASQLLRDSSI 

TSKVTLEIEFDVAESVIPSSGTFHVKLPKKHN 

VELGITISSPSSRKPGDPLYISDIKKGSVAHRT 

GTLELGDKLLAIDNIRLDNCSMEDAVQILQQC 

EDL VKLK1RKDEDN S DEQES S G All YT VELKR 

YGGPLG\ITISGTEEP\FDL*HSSLTtCGGLAERT 

G AIH 1 GDRILVA IN S SSL KGKPLSE A IHLLQMAG 

ETVTLKIICKQTDAQSASSPKKFPISSHLSDLGD 

VEEDSSPAQKPGKLSDMYPSHGCPSVDSAVD 

SWDGSA\IDTS\YGTEGT\SFQASGY\NFNTYD 

WRSPKORGSU SPVTAKPR^OTYPHVOI SVFD 

WDRSTASGFAGAAVDSAETEQEENFWSQALE 

DLETCGQSGlLlvBLEATlMSGSTMSLNHEAPT 

PRSPAGSDRPSFQERSSSRPHYSQTTRSNTLPS 

DVGRKSVTLRKMKQEIKE1MSPTPVELHKVT 

LYKDSDMEDFGFSVADGLLEKGVYVKNIRPA 

GPGDI GGLKPYDRJ I OV^JHVRTRDFUCCJ V 

VJ1 VJ L*r J_i VJ VJ ulvl 1 1_^1VJ_/JL/ V^ V 111 1 V IV 1 X\_L/ 1 J_/ V,\_/l_/ V 

VPLIAESGNKLDLVISRNPLASQKSIDQQSLPG 
D*SEQNSAFFQQPSHGGNLE r ll\JEPTN'rL 


514 


1864 


A 


3967 


833 


800 


LEKQG V SGM ATKRL ARQLGLIRRKS1APANG 
NLGRSKSKOI FDYl rVIDFFSTCWT\IDGKHHH 
SQEIIEFPAVLLNTSTGQIDSEFQAYVQPQEHPI 
LSEFCMELTGIKQAQVDEGVPLKICLSQFCK 
W1HK1QQQKNIIFATG1SEPS/DF* SKIMCICYL 
VR*RISYTY*SKHKSKGC 


515 


1865 


A 


3969 


492 


182 


CRFWGISTHCDTCDPLSPQTTEG**EGDLWSL 
DLLG PEFL ARKPLFKTKT YQSTF* SISKNE/FTC 
PNFIIEEGTDLIF\*QVKHNPCHRLTPEEGTVQL 
NRADS 


516 


J 866 


A 


3977 


2 


1357 


KJSlLC/QK£SNYlRLKJvAKMDKSMFVKIKTLGI 

GAFGEVCLAR1CVDTKALYATKTLRKKDVLL 

RNQ VAI IVKAERDILAEADNEWWRLYYSFQ 

DKDNLYFVMDYIPGGDMMSLLIRJvfGIFPESL 

ARFYIAELTCAVESVHKMGFIHRDIKPDNILID 

RDGHIKLTDFGLCTGFRWTHDSKYYQSGDHP 

RODSMDFSNFWGDP^CRCGDRT KP1 FRRAA 

RQHQRCLAHSLVGTPNYIAPEVLLRTGYTQL 

CDWWSVG VILFEML VGQPPFLAQTPLETQM 

KVINWQTSLHIPPQAKLSPEASDLUKLCRGPE 

DRJLGKNGAJDEIICAHP1F*NQFDFSQ*PEDSRS 

.AFKQFP*NHTTPTDTSNFDP\VDPDKLWSDDN 

EEEN VNDTLN G W YKNG1CHPEHAF YEFTFRRF 

FDDNGYPYNYPKPJEYEY1NSQGSEQQSDEDD 

QNTG SEIKNRDL V Y V 


517 


1867 


A 


3980 


1358 


1022 


FFFKKFTQSLGFLLFSFSFLFSCFFFFHFVLFCY 
VFLDRVPLCHPGWSAVVQSQVTAOnILPPSWD 
*RC1vPPH/LANLCNFCRDVSFT1XPRLVLNTWA 
QAIFQPQPPKVLGLQV 


51S 


1868 


A 


3986 


974 


666 


SPEMESHPITQAGVQWHHLSSLQPLPPGFK*F 
SCFSLPE* LGYRHVPPCLANS VFS VEMGNFLH 
VGQAGLELLTSGDLPALASQSAGITG\SHRAR 
PENGFEN1F 


519 


1869 


A 


3994 


751 


126 


NQGLRHVGLCRTCLVNQMFASS1LGKSHHHS 
LtSTNQGHNALWICAAG\PLPLKAGYC\QSFSPC 
DSLKYG\SWDEKDLTWQRDTHKRSVLRWIS 
QR GK\L A VEMEEGHCU ALPLGTECLG JK\PI V 
HLFSSEMGE\NRPMVG\AJIHVYSNAALLSFTP 
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LRCLGGEKHKSGLHARPVIVPSLJELHYUMJJSl 
AU\APAm 1 T TTTT P9YYIPFC 


j 520 


1870 


A 


3999 


882 


698 


QSFRLSLLSSWDYRHM*PRLANF*T\FFCRDR/ 
SLALLPRLVSNSWPQAILPPRPPKVLGLQT 


521 


1871 


A 


4011 


1346 


1178 


FFF*ETVSCSAS*AGVRSHDNSSLQPPSFU\SSN 

ntiTc a ou\r An AT^Tl-THM A WT 1 
rr 1 aAbii VAuA I u 1 nnn/\ w L-bo v 


522 


1872 


A 


4015 


2 


377 


QGIALLTRMGES VKHVTG G YKLRTRPLEFAA 
IGDYLDTFALKLGTIDR1AQR1IKEEIEYLVELR 
EYGP VY STWS ALEGELAEPLEG V S AC1GNCST 

AJL*ELTDDM 1 EtUrL.r V iiLi ai^oivirv 


523 


1873 


A 


4018 


341 


19 


ERVIHNQ1QQAQRSPHIFNARRSS/PRPNIVELP 
KVKEVCKTSKS/GQVIYKGVSIRLRANFLAEP 
L*NRREWDEAIKVLKEKQ\FLSKMVYPANLSF 
GNEGD1TSFPAK 


524 


1874 


A 


4020 


1067 


743 


FFLRWSL/DSVAQAG VK. W uiNLUaLv Ar rrkjr 
TPFSCLSLPSSWDYRHPPPRLAN* LTNFLCF* * 
RQGFTVLARMVL1S*PHDLPASASQSAGITGL 
SHCSWPTSSILS 


525 


1875 


A 


4021 


781 


351 


QFRVIFFFLRRSHSVAQAGMQWHDHSLLQPL 
PPRLKQ/F/SHLSPPSIWDYRRVPPCLVNFSIFF 
VETGSCQPCLQLLGSSNPPASASQSAGiAGISH 
QGQPE*orL/iKr AL V I/wvL-ivc- ir^iju^jiv v i> 
NKIINRPTHPVESSF 


526 


1876 


A 


4024 


80 


341 


TPSSTSRGTEEQQSSKMAWQRREEKEHLNVR 
RSSAEDGWKADKP/VDG*TPGEDHLPTPSPFQ 
LHIHSSESQLHHSVKSPPSLSFRLM 


527 


1877 


A 


4026 


593 


230 


DFYLYPERKKRGQMMTAVSLTTRPQESVAFE 
DVAVYFTTKEWAIMGXPAERALYRDVMLEN 
YGGCGPL*CHPTSKPALVFSVLEQGKESCFSPA 
TGSSLSRNDWRAGWIGYLELRRYTYLS 


528 


1878 


A 


4028 


1160 


242 


" GTSELLCIQRWN WGP AFPPRPGLALAP'l LQLL 
VEMGSAXSWVTPARPPPHNKHLAKV AJJl Kb 
PSAGILRTPIQVESSPQPGLPAGEQLEGLKHAQ 
DSDPRSPTLGIARTPMKTSSGDPPSPLVKQLSE 
VFETEDSKSNLPPEPVLPPEAPLSSELDLPLGT 
QLS VEEQMPP WNQ1 Er r^R^ V r MvnfcAt<.^r i 
ETPVASQSSDKPSRDPETPRSS\GSMRNRWKP\ 
NSSKVLVGKSPLHPSCQDDNSPGTLTLRQGKA 
AFKPLSENVSELKVEGAMLGTGRXLLKTEGRA 
WEi^Ovt^^tJx\J-irN vrirriv v n,o j 


529 


1879 


A 


4039 


2 


366 


KDK4VLIMEMQSMITMKCPQYL*E*RKiPUl 1 R 
CW*GCGSTGILIFaWS*PL*KTI*QPR*FKQI*T 
ILTIIYSIM*EHTFHNAGV*LSDIYPRFMKGYV 
H 1 E1C- 1 iVLr 1 A v Lir V v VKi w rw^r j 


530 


1880 


A 


4057 


358 


3 


LLEVNGNTIVTVFTKAQNKKNKGSRSILFKQL 
RKYGSRINLLKSKHDKNICTENYKT*MKEIEA 
/DTDKWKDILCSWIRRIHMKDILCSWIGRTHV 


531 


1881 


A 


4061 


50 


278 


TQGTEEIYKISSCEWVQASFSTPLITLHDFK1Y 

HKATVIKMVWYWHRQ*KFSKN/RIESSEIEPH 

IYDQF1FDKGEKIIQEKGNSFFNN/MCWKNWIF 

T*KR 


532 


1882 


A 


4069 


19 


368 


NDLLENFKFWE*FKE* LENIN GTVTEKETGGV 
YKELSSPKYSGTRQFYGQTISNFPGKI1SMVY 
KLFQNTE/TEGRHPISLYEFRITLITIPNKDNIYL 
OIWMPVSLMNIVTLKCPT 


533 


1883 


A 


4076 


1 


355 


PIRKFTKV AG* KSNTPK* LAFLF1INNEQFENK1/ 
ITNI/P^IIASKRIKYSGISLTKEMKDLYTETLLR 
KIKEDTNKWKX)I/SCFWVGR/LNIVKMPK7VIC 
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Y=Tyrosine, X^Unknown. *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insenion 














ITOAIPIKJvIPMMCMAKIEKNSS 


534 


1884 


A 


4088 


3 


1931 


IIDSSTRRMESERSPLYRQLIDLGYLSSS1-IWNC 

GAPGQDTKAQSMLVEQSEKLRHLSTFSHQVL 

QTRLVDAAKALNLVHCHCLDIFINQAFDMQR 

DLQITPKRLEYTRKJCENELYESLMNIANRKQE 

EMKDMIVETLNTMKEELLDDATNMEFKDVI 

VPENGEP VGTREIKCCIRQIQELII SRLNQ A V A 

NKJLISSVDYLRESFVGTLERCLQSLEKSQDVS 

VHITSNYLKQILNAAYHVEVTFHSGSSVTRM 

LWEQIKQHQRJTWVSPPAITLEWKRKVAQEAI 

ESLSASKLAKSICSQFRTRLNSSHEAFAASLRQ 

LEAGHSGRLEKTEDLWLRVRKDHAPRLARLS 

LESRSLQDVLLHRKPKLGQELGRGQYGVVYL 

CDNWGGHFPCALKSVVPPDEKHWNDLALEF 

HYMRSLPKHERJLVDLHGSVIDYNYGGGSSIA 

VLLIMERLHRDJ.YTGLKAGLTLETRLQIAL.DV 

VEGIRFLHSQGLVHRDIKLKNVLLDKQNRAKI 

TDLGFCKPEAMMSGSIVGTPIHMAPELFTGIC 

YDNSVDVYAFGILFWYICSGSVKLPEAFERCA 

SKDHLWNNVRRGARPERLPVFDEECWQLME 

ACWDGDPLKRPLLGIVQPMLQGIMNRJLCKSV 

NSEQPNRGLDDST 


535 


1885 


A 


4090 


2 


417 


ALMPHEAN YEEIFLKTDKDMDGFESGLEVRE 
IFLKTR/GLPSTLLAHIWALCDSICDCGKLSKD 
HI- ALAFI^ITxQKXlKGIDPPL VLTPEKI SPSN R 
ASLQKVTELTRKPVCIIFKGTILWRITDSIWMK 

ttxto l^"o nin r> a 
HiN KJsJKJL W L,KA 


536 


1886 


A 


4102 


569 


829 


DHQK*KNIPCSWIGRTNIVIsJvISILPKAIYRFSAI 
PIKIPMTFFTEI*S*NVYRTTKTQE*AKAILSKK 
hl^[\Lb,b,&tl YLUrK T Y YKAV 


537 


1887 


A 


4104 


54 


281 


SIDCEHLIRRMLVLDPSKRLT1AQIKEHKWML 
IEVPVQRPVLYPQEQENEPSIGEFNEQVLRLM 
HSLGIDQQKTfE 


538 


1888 


A 


4109 


141 


314 


IRHIPLKJRSWSHLKCFYKFILTFFFAGCSQPL 
VPRENITAWMNAIGLIITALPVS 


539 


1889 


A 


4111 


268 


1 


ASRPWGHSYP*FNQQEVDTLKRPIASSEI*MM 
I*KFAT\KKSPGPYRFTAEFSHTFKEDLVPILW 
PLFPKIYREGTLPHSFYEASITL 


540 


1890 


A 


4142 


198 


2064 


PEPGAGRAATPWGPLFWRGRGSGRCEKAAE 
AALGDFLGLHRRTQQPAVDRLLSDASAQWR 
VRG HG G VRES GRAPQQPGRRRG RRPRKRPR 
GR WRREG CG AGGRG VC VAA WSQRS I AGNN 
DYRI.FHKMSNSHPLRPFTAVGETDHVHILSEH 
1GALL1GEEYGDVTFVVEKJCRFPAHRV1LAAR 
CQYFRALLYGGMRESQPEAEIPLQDTTAEAFT 
MLLKYIYTGRATLTDEKEEVLLDFLSLAHKY 
GFPELEDSTSEYLCTILNIQNVCMTFDVASLY 
SLPKLTCMCCMFMDRNAQEVLSSEGFLSLSK. . 
TALLNIVLRDSFAAPEKDIFLALLNWCKHNSK 
ENHAE1M QA VRJLPLM SLTELLN V VRPSGLL SP 

Pi ATT T\\ W VP^PQUntV/UYT MVT5 n\A1 TPT7TXH AT 

MKYGAQVVKGELKSALLDGDTQNYDLDHG 

FSRHPIDDDCRSGIEIKLGQPSIINHVRILLWDR 

DSRSYSYFIEVSMDELDWVRVIDHSQYLCRS 

WQKLYFP.ARVCRYIR1VGTHNTVNK[FHIVAF 

ECMFTNKTFTLEKGLIVPMENVATIADCASVI 

EGVSRSRNALLNGDTKNYDWDSGYTCHQLG 

SGAIVVQLAQPYMIGSIRVLLWDCDDRSY 


541 


1891 


A 


4146 


282 


778 


GTLGYPNGARGQPQDNFFAHQWSHHPPISAC 
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F=Phenylaianine, G^Glyeine, H=Histidine, 
l=lsoleucine, K=Lysine, L=I .eucine, 
M=Methionine, N=Asparagine, P=Proline ? 
Q=Glutamine, R-=Arginine s S=Serine, 
T=Threonine, V=VaIine, W=Tryptophan 5 
Y=Tyrosine, X=Unknown 5 *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














HAE SENF AF WQDMK WKNKF WGKSLEI VP V Ci 
TVNVSLPRFGDHFEWNK.V I dCli-tlN VLbO^ivvv 
IEHYGEVLIRNTQDSSCHCKITFCKAKYWSSN 
VHEVQGAVLSRSGRVLHRLFGKWHEGLYRG 
PTPGGQCIWKP 


542 

I 


1892 


A 


4147 


44 


433 


S VD A Y V CNDI VF S YRTTITLLEG A* LTHRY V A 
QDPKQGQLRSLHLTCDSAPAGSQGTWSTSCR 
INHLIFRGGAQITFLATFDDSPKAVLGDRLLLT 
ANVSSENNTPRTSKTTrQLEL^VKiJA v livv 
SSH 


543 


1893 


A 


4153 


678 


11 


TISYPQCLTQMYFL1SFANVDTFLLPIMALDH 

YVAICSALQ*CSI1TP/ELCQGLPVLA*AGSSLIS 

PVHTVIMSRLAFCSSAQISHFYRDAYLLMKIA 

CSHT*\N QHVFLG AVVLFLAPCAL1LVS YIRIA 

AAILRIPSPTRRRKA-CSICSSHLbLV 1 Lr ylj i v 

LG1CI*PPDSFSAQDAIATIMYTVVTSMLNPFIY 

SLMNKEVQEAVRRLFSRGSHSSWCW 


544 


1894 


A 


4158 


3 


538 


LL Y AQ AG VQ * LNLSSLQPQPAGLKQ S SI IP SLP 
SSWDYRYSTPHPANFFVEMEFHHVAQAGLEL 
LGSGDLPTSTSHSAGITGV\SHFLAPPRLISSbOi> 
LLGHLLCLPMVFPLLCVFVIJSSSLAGEEAAG 
LRVQKLWPAVVLSHLPVCWFHCSG1WSEVIE 
LKVGREGHVLPWQAHVVEF 


545 


1895 


A 


4160 


1 


412 


HPLGLGLVPSEIFSPQDKKAADU SlLAKAKUis 

DLEAGLKGSFMDGRLQASVSVFR1QRVGSAM 

QDTASAMPCLPYYPTSHCFMAGGKSRSQGW 

ELELSGEPAPGWQVLAGYTYTQARYLRDASE 

ANVGQPLRPVDPR 


546 


1896 


A 


4174 


1252 


1190 


FFQVFIFLFLIFFKTEFHSCCPGAVQWHDLDSL 
QPPPPRFKGFSCLSLPSSWDYRHAPAHPANFV 
FL VETGFLH V\G Q\ASLELPTSGDTP ASVASQ S A 
G ITGVSHHA * PRASGRRC W 


547 




1897 


A 


4176 


3029 


1 


AGPDGLAAPASCQGARGQTRVPGAFSWLAP 

G SHHASEGL APGVPP AGG VS AQELTAPPQEG 

WGLGAPPAAPRPESDEKRAGSDAVRSFSRGA 

RDSLGQRRLGGTRGAGPAGKGAQRTMGPAS 

GFHSFPPRPHQEPSPRSSCWQHLLWHCPWPQ 

PSRLPRLTPAQLLQGPGVLAAPPGP*HVPGFL 

AQSPWPLPSGPRSP*DPLHQGALVPLPQGGSP 

HTAPHCLPSVLSPAIQQPLLPTAST/SSRSPPAS 

TMAPIPSALAVWEPAGSSPQLSSAPADSS\PLP 

ALPKVLPPWTQKPLLGCLCQSPLPLLSPPDQI/ 

RCPPACSPAAASSFSFESQPCPSAPSKASPAPA 

ALU VGPHHPP* SQQPQSQS VHPHGPGGPQPPL 

AASSLFWMFCQPPPPHPQFL WHRPLPVTGKA 

LASVPLCFRPAPGSLRQTPLPPQFHIPRPGLSAP/ 

PPPASGTSDSSDSRSPSASAARVWPPAXSPPPP 

AARHRPHPPEYFLSPCPFSCGFPRLLGRPRRPQ 

ALQTPRAWDLPPGSSPAPLCSGPELP*APPPLP 

PFPRVA*LGSGHPPSAQVPGLW*RCV*GHPIP 

RPVGHS*SGPPHSPPL*APPQAWPLELPPSRQC 

LQPLHLRAAQPLDPCCSLSPPGPPLPVPALPS 

WPGRP*SPSPASSQPPYHAGLPGPQSSPLPPGL 

PQLPSLRSGSQQPLLFFQCPGPGAVWGKGSPQ 

PLSPHPPPP/ARTQTFPVASRSLSPGTAPYSVCL 

TPSRSASSLPEWLASSLPKJPQSSGS\PLGPTSP 

MP*CFHRPSPPLP/LSSPFPA\LRPQAPQFPLHLP 

P*PPAPSPGCPLPPLAQQHQPSPPSPHARSTLT 

PPLWPSLALi.P*PLPPPPPVPSFSASLLCSLPAH 
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D=Aspartic Acid, E=Glutamic Acid, 
F=Phcnylalaninc, G=Glycinc, H=Histidine } 
I=Isoleucine, K=Lysine, L=Leueine, 
M=Methionine, N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
J=inreonine, v= valine, w— i ryptopnan, 
Y=Tyrosine. X=Unknown s *=Stop codon, 
/^possible nucleotide deletion, V=possibIe 
nucleotide insertion 














GTPASPGLGRSCLGKPQTLPWISFWPPSGRLA 

PGTWQPW/PVSPAPLSCLSAWDPWELPSPQPQ 

VCSTAELPTSCLLSSPGP\PAFQPPRFGCL*GPP 

GPPGLPPLQSSLSFPPPPPPVPQPPAPPALQWG 

LHLPGGRTK 


54S 


1898 


A 


4180 


2369 


844 


RJHREEDFQFILKGIARLLSNPLLQTYLPNSTK 

KIQFHQELLVLFWKLCDFNKVGQPRGALQGD 

GEQLPQ*PGGRDSVRLRGVGQSCPSLELSPLG 

PSPHP*KFLFFVLKSSDVLDILVPILFFLNDAR 

ADQSRVGLMHIGVFILLLLSGECNFGVRLNKP 

YSIR VPMDI PVFTGTH ADLLI V\ VFHKIITSGHQ 

RLQPLFDCLLT1VVNVSPYLKSLSMVTANKLL 

HLLEAFSTTWFLFSAAQNHHLVFFLLEVFNNI 

1QYQFDGNSNLVYAIIRKRSIFHQLANLPTDPP 

TIHKALQRRRRTPEPLSRTGSQGGAPPWRAPA 

PLPLQSQAPSRPVWWLLQALTS'PRSPRCQR 

MAPCGPWNLSPSRAWRMAARLRGSPARHGG 

SSGDRP/HSSASGQWSPTPEWVLSWKSKLPLQ 

TIMRLLQVLVPQVEKICIDKGLTOESEILRFLQ 

HGTLVGLLPVPHPIURKYQANSGTAMWFRT 

YM W G VI YLRN VDPP V W YDTD VKLFEIQRV 


549 


1899 


A 


4191 


858 


321 


LPWQRLGVLLSRGKMAVTGWLESLRTAQKT 

ALLQDGRRK VH YLFPDGKEMAEE YDEKTSE 

LLVRiCWRVKSALGAMGQWQLEVGDPAPLG 

AGNLGPELIKESNANPIFMRKDTKMSFQWRIR 

NLPYPKDVYSVSVDQKERCIIVRTTNKKYYK 

KFSIPDLDRHQLPLDDALLSFA\TPTAP 


550 


1900 


A 


4192 


1 


1980 


IRHTGSDIAGVCGWLLLSGPCGVGLDLDSRLL 

GASAMRRSEVLAEESIVCLQKAJLNHLREIWE 

L1GIPEDQRLQRTEVVICKHIKELLDMM1AEEE 

SLKERLIKSISVCQKELNTLCSELHVEPFQEEG 

ETTIL QLEKDLRTQ VEL MRKQKKERKQE\LKL 

LQEQDQELCVEILCMPHYDIDSASVPSLEELNQ 

FRQHVTTLRETKASRREEF/VSSIKRQIILCME 

ELDHTPDTSFERDVVCEDEDAFCLSLENIATVL 

QKLLRQ\LEMQKSQNEAVCEG\LRTQI\RELW 

DRLQIPEEEREAVATIMSGSKAKVRK\ALQ\LE 

VDRLEELEKCKTMKICVIEAIRVELVQYWDQC 

FYSQEQRQAFAPFCAEDYTESLLQLHDAE1VR 

LKNYYEVHKJELFEGVQK\Vfc.ETWRLFLEFER 

KASDPNRFTNRGGNLLICEEKQRAICLQKMLP 

KLEEELKARIEL WEQEHSKAFM VNG QKFME 

YVAEQWEMHRLEICERAKQERQLKNKKQTET 

EMLYGSAPRTPSKRRGLAPNTPGICARKLNTT 

TMSNATANSSIRPIFGGTVYHSPVSRLPPSGSK 

PVAASTCSGKXTPRTGRHGANKENLELNGSI 

LSGGYPGSAPLQRNFSINSVASTYSEFADPSLS 

DSSTVGLQRELSKASKSDATSGILNSTNIQS 


551 


1901 


A 


4194 


3 


1008 


AWHEGLVSSPAIGAYLSASYGDSLVVLVATV 
VALLDICFILVAVPESLPEKMRPVSWGAQISW 

AG\QYSSFF\LYLR\QVIGFG\TVKIAAF1AMVGI 

LSIVAQTAFLSILMRSLGNKNTVLLGLGFQML 

QLAWYGFGSQAWMMWAAGTVAAMSSITFP 

AISALVSRNAESDQQGVAQGIITGIRGLCNGL 

GPALYGFIFYMFHVELTELGPKLNSNNVPLQ 

GAVIPGPPFLFGACIVLMSFLVALFIPEYSKAS 

GVQKHSNSSSGSLTNTPERGSDEDIEPLLQDS 

SIWELSSFEEPGNQCTEL 
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SEQ ID 
KO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoIeucine 9 K=Lysine, L=Leucine, 
M^Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y^Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 


552 


1902 


A 


4197 


2 


14302 


ARPPP APGSRQQKQKAAPGAAAAAELRCi AK 

EPAPARRRGTMADGGEGEDEIQFURTDDEVV 

LQCTATIHKEQQKLCLAAEGFGNRLCFLESTS 

NSKNVPPDLSICTFVLEQSLSVRALQEMLANT 

VEKSEGQVDVEKWKFMMKTAQGGGHRTLL 

YGHAILLRHSYSGMYLCCLSTSRSSTDKLAFD 

VGLQEDTTGEACWWT1HPASKQRSEGEKVR 

VGDDLILVSVSSERYLHLSYGNGSLHVDAAF 

QQTLWS VAPI SSGSEAAQG YLIGGD VLRLLH 

GHMDECLTVPSGEHGEEQRRTVHYEGGAVS 

VHARSLWRLETLRVAWSGSHIRWGQPFRLR 

HVTTGKYLSLMEDKNLLLMDKEKADVKSTA 

FTFRSSKEKLDVGVRKEVDGMGTSEIKYGDS 

VCYIQHVDTGLWLTYQSVDVKSVRMGS1QR 

KAIMHHEGHMDDGISLSRSQHEESRTARVIRS 

TVFLFNRFIRGLDALSKKAKASTVDLPIESVSL 

SLQDLIGYFHPPDEHLEHEDKQNRLRALKNR 

QNLFQEEGMTNLVLECIDRLHVYSSAAHFAD 

VAGREAGESWKSILNSLYELLAALIRGNRKN 

CAQFSGSLDWLISRLERLEASSGILEVLHCVL 

VESPEALNIIKEGHIKSI1SLLDKHGRNHKVLD 

VLCSLCVCHGVAVRSNQHLICDNLLPGRDLL 

LQTRLVNHVSSMRPNIFLGVSEGSAQYKKWY 

YELMVDHTEPFVTAEATHLRVGWASTEGYSP 

YPGGGEEWGGNGVGDDLFSYGFDGLHLWSG 

CIARTVSSPNQHLLRTDDVISCCLDLSAPSISF 

RJNGQPVQGMFENFNIDGLFFPVVSFSAGIKV 

RFLLGGRHGEFKFLPPPGYAPCYEAVLPKEKL 

KVEHSREYKQERTYTRDLLGPTVSLTQAAFT 

PIPVDTSQIVLPPHLER1REKLAENIHELWVMTM 

KIELGWQYGPVRDDNKRQHPCLVEFSKLPEQ 

ERNYNLQMSLETLKTLLALGCHVGISDEHAE 

DKVKKMKLPKNYQLTSGYKPAPMDLSFIKLT 

PSQEAMVDKLAENAHNVWARDRIRQGWTY 

GIQQDVKNRRNPRLVPYTPLDDRTKKSNKDS 

LREAVRTLLGYG YNLE APDQDHAARAEVC S 

GTGERFRIFRAEKTYAVKAGRWYFEFETVTA 

GDMRVGWSRPGCQPDQELGSDERAFAFDGF 

KAQRWHQGNEHYGRSWQAGDVVGCMVDM 

NEHTMMFTLNGEILLDDSGSELAFKDFDVGD 

GFIPVCSLGVAQVGRMNFGKDVSTLKYFTIC 

GLQEGYEPFAVNTNRDITMWLSKRLPQFLQV 

PSNHEHIEVTRIDGTIDSSPCLKVTQKSFGSQN 

SNTDIMFYRLSMPIECAEVFSKTVAGGLPGAG 

LFGPKNDLEDYDADSDFEVLMKTAHGHLVP 

DRVDKDKEATKJPEIHS1NHKDYAQEKPSRLKQ 

RFLLRRTKPDYSTSHSARLTEDVLADDRDDY 

DFLMQTSTYYYSVRIFPGQEPANVWVGWITS 

DFHQYDTGFDLDRVRTVTVTLGDEKGKVHE 

SIKRSNCYMVCAGESMSPGQGRNNNGLEIGC 

VVDAASGLLTFJANGKELSTYYQVEPSTKLFP 

AVFAOATSPNVFOFELGRIKNVMPLSAGLFKS 

EHKNPVPQCPPRLHVQFLSHVLWSRMPNQFL 

KVDVSRI SERQG WT .VQCLDPLQFMSLHIPEEN 

RSVDILELTEQEELLKF'HYHTLRLYSAVCALG 

NHRVAHALCSHVDEPQLLYAIENKYMPGLLR 

AGYYDLLIDIHLSSYATARLMMNNEYIVPMT 

EETKSITLFPDENKKHGLPGIGLSTSLRPRMQF 

SSPSFVSISNECYQYSPEFPLDILKSKTIQMLTE 

AVKEGSLHARDPVGGTTEFLFVPLIKXFYTLLI 
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SEQ ID 

INU. 01 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

fNVj. OI 

peptide 
seq- 
uence 


Met 
nou 


SEQ 
in "Mfv 

1U JN W . 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F-^Phenylalanine ? G=Glycine, H=Histidine, 
l=Isoleucine ? K=Lysine ; L=Leucine, 
M=Methionine, N=Asparagine ? P=Proline 3 
Q=Glutamine, R=Arginine ? S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possib!e 
nucleotide insenion 














MGIFHNEDLKHILQLIEPSVFKEAATPEEESDT 

LEKELSVDDAKLQGAGEEEAKGGKRPKEGLL 

QMKLPEPVKLQMCLLLQYLCDCQVRHRIEA1 

VAFSDDFVAKLQDNQRFRYNEVMQALNiMSA 

ALTARKTKEFRSPPQEQINMLLNFKDDKSECP 

CPEEIRDQLLDFHEDLV1THCG1ELDEDGSLDG 

NSDLTIRGRLLSLVEKVTYLKKKQAEKPVES 

DSKKSSTLQQLISETMVRWAQESV1EDPELVR 

AMFVLLHRQYDGIGGLVRALPKTYTINOVSV 

EDTINLLASLGQIRSLLSVRMGKEEEKLMIRG 

LGDlMNNKWYQI-rPNLMRAt.GMHETVMEV 

MVNVLGGGESKEITFPKMVANCCRPLCYFCR 

1SRQNQKAMFDHLSYLLENSSVGLASPAMRG 

STPLDVAAASVMDNNELALALREPDLEKVVR 

YLAGCGLQSCQMLVSKGYPDIGWNPVEGER 

YLDFLRFAVFCNGESVEENANVVVRLLIRRPE 

CFGPALRGEGGNGLLAAMEEAIKIAEDPSRJD 

GPSPNSGSSKTLDTEEEEDDTIHMGNAIMTFY 

SALIDLLGRCAPEMHLIHAGKGEAIRIRSILRS 

LIPLGDLVGVISIAFQMPTIAKDGNVVEPDMS 

AGFCPDHKAAMVLFLDRVYGIEVQDFLLHLL 

EVGFLPDLRAAASLDTAALSATDMALALNRY 

LCTAVLPLLTRCAPLFAGTEHHASLIDSLLHT 

VYRLSKGCSLTKAQRDSIEVCLLSICGQLRPS 

MMQHLLRRLVFDVPLLNEHAICMPLKLLTNH 

YERCWKYYCLPGGWGNFGAASEEELHLSRK 

LFWG1FDALSQKKYEQELFKLALPCLSAVAG 

ALPPDYMESNYVSMMEKQSSMDSEGNFNPQ 

PVDTSNITIPEKLEYFINKYAEHSHDKWSMDK 

LANGWIYGEIYSDSSFCVQPLMKPYKLLSEICE 

KEIYRWPIKESLKTMLARTMRTERTREGDSM 

ALYNRTRRISQTSQVSVDAAHGYSPRAIDMS 

NVTLSRDLRAMAEMMAENYHNIWAKKKICM 

ELESKGGGNHPLLWYDTLTAKJSKAKDREKA 

QDILKFLQINGYAVSRGFKDLELDTPSIEKRFA 

YSFLQQLIRYVDEAHQYILEFDGGSRGKGEHF 

PYEQEIKFFAKWLPLIDQYFKNHRLYFLSAA 

SRPLCSGGHASNKEKEMVTSLFCICLGVLVRH 

RISLFGNDATSrVNCLHILGQTLDARTVMKTG 

LESVKSALRAFLDNAAEDLEKTMENLKQGQF 

THTRNQPKGVTQriNYTTVALLPMLSSLFEHI 

GQHQFGEDULEDVQVSCYRILTSLYALGTSK 

SrYVERQRSALGECLAAFAGAFPVAFLETHLD 

K^IYSITOTKSSRERAALSLPTNVEDVCPNIP 

SLEKLMEEIVELAESGIRYTQMPI TViVfE VILPM 

LCSYMSRWWEHGPENNPERAEMCCTALNSE 

HN1NTLLGNILKII YNNLG1DEGA WMKRLA VF 

SQPITNnCVKPQLLKTHFLPLN^ia.KKKAATVV 

SEEDHLKAEARGDMSEAELL1LDEFTTLARJDL 

YAFYPLLIRFVDYNRAKWLKEPNPEAEELFR 

MVAEVFIYWSKSIINFICREEQNFVVQNEINN 

1VJ.OJT J-/J 1 1 JVOI\JVJOi\A/A V OJ^V^i^I\.i\JVJvll\JVJ\.VjJ-^ 

RYSMQTSLIVAALKRLLPIG1.NICAPGDQEL1A 

LAKNRFSLKDTEDEVRDIIRSNIHLQGKLEDP 

AIRWQMALYKDLPNRTDDTSDPEKTVERVL 

DIANVLFHLEQKSKRVGRRHYCLVEHPQRSK 

KAVWHKLLSKQRKRAVVACFRJVL\PLYNLPR 

HRAVNLFLQGYEKSWIETEEHYFEDKLIEDLA 

KPGAEPPEEDEGTKRVDPLHQLILLFSRTALT 

EKCKLEEDFLYMAYADIMAKSCHDEEDDDG 
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SEQ ID J 
NO: of I 
nucl- \ 
eotide J 
seq- i 
uence 


>EQ ID I 

s(0: of 1 
peptide 
;eq- 
jence 


Viet J 
lod 


SEQ 1 
lD NO: 1 
n i 
USSN 1 
D9/496 < 
914 


Predicted 

beginning 

nucleotide 

ocation 

^orrespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteme, 

r\ a oTT^T-+i i- Ar\t\ "F=Gliitamic Acid, 

F=Phenylalanine, G=Glycine, FNHistidine, 
I=Isoleucinc, K=Lysine, L=Leucine, 
M=Methionine 1 N-Asparagine, P=Proline, 
Q=Gluiamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V^possible 
nucleotide insertion 
















EEEVKSFEEKEMEKQKLLYQQARLHDRGAA 

EMVLQTISASKGETGPMVAATLKLGIAILNGG 

NSTVQQKMLDYLKEKKDVGFFQSLAGLMQS 

CSVLDLNAFERQNKAEGLGMVTEEGSGEKV 

LQDDEFTCDLFRFLQLLCEGHNSDFQNYLRT 

QTGNNTTVNIIISTVDYLLRVQESISDFYWYY 

S GKD V1DEQGQRNFSKAIQVAKQ VFNTLTEYI 

QGPCTGNQQSLAHSRLWDAVVGFLHVFAHM 

QMKLSQDSSQ1ELLKELMDLQKDMVVMLLS 

MLEGNWNGTIGKQMVDMLVESSNNVEMIL 

KFFDMFLKLKDLTSSDTFKEYDPDGKGV1SK 

RDFHKAMESHKHYTQSETEFLLSCAETDENE 

TLDYEEFVKRFHEPAKDIGFNVAVLLTNLSEH 

MPNDTRLQTFLELAESVLNYFQPFLGRIEIMG 

SAKR1ERVYFEJSESSRTQWEKPQVKESKRQFI 

FDVVNEGGEKEKMELFVNFCEDTIFEMQLAA 

Q1SESDLNERSANKEESEKERPEEQGPRMAFF 

SILTVRSALFALRYNILTLMRMLSLKSLKKQM 

KKVKKMTVKDMVTAFFSSYWSIFMTLLHFV 

ASVFRGFFRIICSLLLGGSLVEGAKKIKVAELL 

AN MPDPTQDEVRGDGEEGERKPLEAALPSED 

LTDLKELTEESDLLSDIFGLDLKREGGQYKLIP 

HNPNAGLSDLMSNPVPMPEVQEKFQEQKAK 

EEEKEEKEETKSEPEKAEGEDGEKEEKAKED 

r/rvnk'T cm HTHTC YGFPE VPESAFWKXI1AY 

QQKLLNYFARNFYNMRMLALFVAFAINFDLL 

FYKVSTS S VVEGKELPTRSSSEN AK VTSLDSS 

SHRHAVHYVLEESSGYMEPTVRILPILHTV1SF 

FCIIGYYCLKVPLVIFKREKEVARKLEFDGLYI 

TEQPSEDDIKGQWDRLVINTQSFPNNYWDKF 

VKRKVMDKYGEFYGRBR1SELLGMDKAALD 

FSDAREKKKPKKDSSLSAVLNSIDVKYQMW 

T r v r, \r \ /"PTT»J *nF I YLAWYMT 


553 


1903 


A 


4199 


31 


767 


LPELNGRGAGLRRAEPSERGGGAER'l gg V AA 
LPLSHGHSHGGGGCRCAAER/VGAARGSAAC 
AYGLYLRIDKGRLQCLNESREGSGRGVFKPW 
ERAD\DRSKFVESDADEELLFNIPFTG\HVKLK 
G1IIMGEDDDSHPSEMRLYKNIPQMSFDDTER 
EPDQTFSLNRDLTGELEYATKISRFSNVYHLSl 
HISKNFG ADTTKVF YIGLRGEWTELRRHEVT1 
CNYEASANPADHRVHQVTPQTHFIS 




554 


1904 


"A 


4200 


1 


961 


GIPCTEMGNFDNANVTGEIEFAIHYCFKTHSL 

E1CIKACKNLAYGEEKKKKCNPYVKTYLLPD 

x> c cnnir T? T<f Tf T VORNT VDPTFOETLKYQ V APA 

QLVTRQLQVSVWHLGTLARRVFLGEVIIPLAT 

WDFEDSTTQSFRWHPLRAKADKYEDSVPQS 

NGELTVRAKLVLPSRTRKLQEAQEGTDQPSL 

HGQLCLVVLGAKNLPVRPDGTLNSFVKGCLT 

LPDQQKLRLKSPVLRKQACPQWKHSFVFSGV 

TPAQLRQSSLELTV\VDQALFGMNDRLLGGT\ 

RLGSKGDTAVGGDACSQSKLQWQKVLSSPN 

LWTDMTLVLII 




555 


1905 


A 


4211 


331 


2419 


"KENKKARNLRMNQSRSRSDGGSEETLFQDH 
NHHENERRWQQEREHREEAYYQFINELNDE 
DYRLMRDHNLLGTPGE1TSEELQQRLDGVKE 
QLASQPDLRDGTNYRDSEVPRESSHEDSLLE 
WLNTFRRTGNATRSGQNGNQTWRAVSRTNP 
NNGEFRFSLEIHVNHENRGFEIHGEDYTDIPLS 
D SNRDHT ANRQQRSTXSP V ARRTRSQTS VNFN 
GSSSNIPRTRLASRGQNPAEGSFSTLGRLRNGT 
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SEQ ID 
Nu. of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
rvo. or 
peptide 
seq- 
uence 


Met 
noa 


SEQ 

IL/ /VL>. 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine ; H=Histidine, 
l=Isoleucine, K=I.ysine, L^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutarnine, R=Arginine, S=Serine ; 
T^Threonine, V=V aline, W=Tryptophan, 
Y^Tyrosine. X=Unknown, *=Stop codon, 
/=possiblc nucleotide deletion, \=possib!e 
nucleotide insertion 














GGAAGIPRANASRTNFSSHTNQSGGSELRQRE 

GQRFGAAHVWENGARSNVTVRNTNQRLEPI 

RLRSTSNSRSRSP1QRQSGTVYIINSQRESRPV 

QQTTRRSVRRRGRTRVFLEQDRERERRGTAY 

TPFSNSR1..VSRTTVEEGEESSRSSTAVRRHPTIT 

LDLQVRVRJRPGENRDRDSIANRTRSRVGLAE 

NTVTIESNSGGFRRTISRLERSGIRTYVSTITVP 

LRRISENELVEPSSVALRSILRQIMTGFGELSSL 

MEADSESELQRNGQHLPDMHSELSNLGTDN 

NRSQHREGSSQDRQAQGDSTEMHGENETTQP 

HJ KNMJbKOGRQL.KNrNNLVhl GTLPILRJ^AH 

FFLLNESDDDDRiRGLTKEQIDNLSTRHYEHN 

SIDSELGKICSVCISDYVTGNKLRQLPCMHEF 


556 


1906 


A 


4212 


3 


462 


LQRQRQHPAAAPAVPVRCFTFCFTDIV1MPKR 
KSPENTEGKDGSKVTKQEPTRRSARLSAKPA 
PPKPEPKPRKTSAKKEPGAK1SRGAKGKKEEK 
QE AGKEGTAPSEN GETICAEEIHISRST VN VST 
SRGTPPSTLSVKGQIETVRVKGTEN 


557 


1907 


A 


4213 


774 


507 


ARRFSCLTLQTSWGHRH\GPPRP\ANFVFLVET 

GFLHIGQAGHKLPTSGDPPASASQSARITGMS 

HRTWFLASFLIDSCKNFIVYKIMYTL 


558 


190S 


A 


4225 


3 


1253 


TYRHAEREHPETSSATKVSYDYRHKRPKXLD 

G DQDFSD GRTQK YCKEEDRK Y S FQKGPLNRE 

LDCFNTGRGRETQDGQVREPFKPSKKDSIAC 

TYSNKNDVDLRSSNDKWKEKKXKEGDCRKE 

SNSSSNQLDKSQKLPDVICPSPINLRKKSLTVK 

VDVIvKTVDTFRVASSYSTERQMSHDLVAVG 

RKSENFHPVFEHLDSTQNTENKPTGEFAQEHT 

UHQVK^NYFPSPGITLHEIU^S\0>IADIHKADV 

NErPLNSDPEIHRRIDMSLAELQSKQAVIYESE 

QTLIKI1DPNDLRHDIEIUIRKERLQNEDEH.IFHI 

ASAAERDDQNSSFSKNYTTQRKDIITHKPFEV 

EGNHRNTR VRPFK SNFRG GRCQPNYJCSGL VQ 

KSLYIQAKYQI^LRFTGPRGFITHKFRERLMRK 

KKVP 


559 


1909 


A 


4235 


1 


323 


KFSIPFFLRWSFTLVNPRLEGNDMISVHCNLGL 
LGLSHSPASASQVGGITGTQHHTGLIFGFLIET 
EFHHVGQAGLELLTSGDPPALAFQSAGITGVS 
HHAWLQVLNS 


560 


1910 


A 


4246 


2 


1569 


TLSLLERVLMKDIVTPVPQEEVKTVIRKCLEQ 

AALVNYSRLSEYAKIEGICKREMYELPVFCLA 

SQVMDLTIQNQKDAENVGRLITPAKKLEDTIR 

LAELVIEVLQQNEEHFIAEAFAWWSDLMVEH 

AETFLSLFAVDMDAALEVQPPDTWDSFPLFQ 

LL\NDFLRTGLLICGNGK\FHKHLQDLFAPLVV 

IVYMVVDLDGSSPIAQSIHRGLLSRESWEPNHW 

G SGTSEDLF WKLD ALQTFIRDLI IWPEEEFGK 

HLEQRLKLMAS DMIESC VKRTR\I AFE VKLQK 

TSSIQQIFRVPQFNMAPCFNVMGLMAKGSIQP 

K] \P^K/fPMOOPFA rv MWHOYH'sVinPI 1PPTV 

IV-L^ \v^OiVlX_/lVi VJ AVrv_lvl Wily I n j^- i I J Ei 1 - 1 C I f 1 V 

ICEMITLLVAKFVTILEGVLAKLSRYDEGTLFS 

SFLSFTVKAASKYVDVPKPGMDVADAYVTF 

VRHSQDVLRDKVNEEMYIERLFDQWYNSSM 

NVICTWLTDRMDLQLHIYQLKTLIRNIVKKTY 

RDFRLQGVLDSTLNSKTYETIRNRLTVEEATA 

SVSEGGGLQGISMKDSDEEDEEDD 


561 


1911 


A 


4257 


1300 


654 


SELVQFLLIKDQKKIPIKRADILKHVIGDYKDI 
FPDLFKJlAAEI^QYVTGYIvLVELEPKSNTYIL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID "NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
coi respond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

T"\ A onorftn A r*iA P^frllltflm \C Add 

LJ — /YSparilC /•vClCl, E< — uiuuumi. y-vwiw, 

F=Phenylalanine, G-Glycine, H=Histidine, 
I=Isolcucine 3 lOLysine, L=Lcucinc, 
M-Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

T=Tbrpnnine V=Valine W=TrVDt0Dhan 

Y= tyrosine, X=Unknown, *=Stop codon, 
/possible nucleotide deletion, ^possible 
nucleotide insertion 














INTLEPVEEDAEMRGDQGTPTTGLLMIVLGLI 

FMKGNTIKETEAWDFLLAUGVYPTKKHLIFG 

DPKKUTEDFVRQRYLEYRR1PHTDPVDYEFQ 

WGPRTNLETSKMKVLKFVAKVHNQDPKDW 

PAQYCEALADEENRARPQPSGPAPSS 


562 


1912 


A 


4260 


1 


1498 


MVTWLYRFLPTSNMAAKLRSLLPPDLRLQF 

WLHARJLQKCFLSRGCGSYCAGAKASPLPGK 

MAMGLMCGRRELLRJLLQSGRRVHSVAGPSQ 

WLGKPLTTRLLFPAAPCCCRPHYLFLAASGPR 

SLSTSAISFAEVQVQAPPVVAATPSPTAVPEV 

ASGETADVVQTAAEQSFAELGLGSYTPVGLI 

QNLLEFMHVDLGLPWWGAIAACTVFARCLIF 

m T\/Tr.nT?P a a P IHTJHT PFIOTCFSSRTRFA1CLA 

GDH1E Y YKAS SEM AL YQ1CKHG1KL YKPLILPV 

TQAPIFISFFIALREMANLPVPSLQTGGLWWF 

QDLTVSDPIYILPLAVTATMWAVLELGAETG 

VQ S SDLQ WMRNVIRMMPLITLPITMHFPT A V 

FMYWLSSNLFSLVQVSCLRIPAVRTVLKIPQR 

\nmni rvtrT PT>i?Pf";FT P ^ FIT W H WKN A FMTRO 
V V rlJLJl^J^iVLdr i JKJDVJF JL»l-»or IViVvj vv rvi \ y-vi^ivi i 

LREREQRMRNQLELAARGPLRQTFTHNPLLQ 
PGKDNPPNIPSS\SSSSSKPKSKYPWHDTLG 


563 


1913 


A 


4265 


623 


116 


MG GL APTQTLEPTNRE YQNTQLS VS YLLPEQN 
TMOTRPTT ^OPSNNI PT PI SSSATMPSMOCK 
HRSPNGGLFRQSPVK/TPPIPMSFQPVPGGVXL 
PRGSGNPPHGTSILTAPPALLPHPPTHPTQQSF 
LIQENNNTNHTHSHTHTYTETLSFFLYICVNN 
FiR MFWOK WF 


564 


1914 


A 


4270 


3 


368 


ILKRKLSSLNSEVSTIQNTRML.AFKATAQLFIL 
GCTWCLGLLQVGPAAQVMAYLFTIINSLQGF 
FIFLVYCLLS\QQVQKQYQKWFREIVKSKSES 
ETYTLSSKMGPDSKPSEGDVFPRTSE 


565 


1915 


A 


4288 


83 


406 


RNSRPLWCSPPASQPRQAPVSQSCCCPLPSSSS 
PPS ALLAPTKPRALGTLRL YECSPEL GTTMLP 
PAWLLMLCQAPRPQDPDPRLTQPEKSLQEAP 
GQTGASRTPRT 


566 


1916 


A 


4298 


1041 


229 


GVRYLTLTFTCSTPWAESSTKFRHHMYTNVS 

GLTSFGEKWEELNRLGMMIDLSYASDTLIRR 

VLEVSQAPVIFSHSAARAVCDNLLNVPDDILQ 

LLKKNGGIVMVTLSMGVLQCNLLANVSTVA 

DHFDHIRAVIGSEFIGIGGNYDGTGRFPQGLAE 

DVSTYPVLIEELLSRSWSEEELQGVLRGNLLR 

VFRQVEKVREESRAQSPVEAEFPYGQLSTSCH 

FHLGASEWTPRLLIWR 


567 


1917 


A 


4299 


1 


1106 


GATPLGSVGGRTGKMDAATLTYDTLRFAEFE 
DFPETSEPVWILGRKYSIFTEKDEILSDVASRL 
WFTYRKNFPAIGGTGPTSDTGWGCMLRCGQ 
urcAOAT VPRH1 np.nWRWTORKROPDSYFS 
VLNAFroRKDSYYSIHQIAQMGVGEGKSIGQ 
WVnPrJTVAOVT KKT AVFDTWSSLAVH1AMD 
NTVVMEEIRRLCRTSVPCAGATAFPADSDRH 
C^GFPAGAEVTNRPSPWRPLVLLIPLRLGLTD 
INEAYVETLKHCFMMPQSLGVIGGKPNSAHY 
FIGYVGEELIYLDPHTTQPAVEPTDGCFIPDES 
FHCQHPPCRMSI AELDPSI A VVRGGI ILSTQ AF 
G AECCLGMTRKTFGFLRFFF SMLG 


568 


1918 


A 


4300 


2012 


1S43 


SRKFLTITPIVLYFLTSFYTKYDQIHFVLNTVS 
LMSVLIPKLPQLHGVR1FGINKY 


569 


1919 


A 


4302 


186 


531 


WTFCLFL/WWVPESARWLLTQGHVKEAHRY 
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SEQ ID 

nucl- 
eotide 

uence 


SEQ ID 

peptide 

seq- 

uence 


Met 

ntMj 


SEQ 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 

C o r iv* cn C\ n rt i 
vUJ 1 CojJUl 1 LI 1 

ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

It J IdM dlllllKJ 

acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteine ; 
D=Aspartic Acid, b=Cjlutamic Acid. 
F^Phenylalanine, G^=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
ivi — ivicuiiuitine, in — .rtspdrdginc, r — jrroiine, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V- Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon. 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














LLHCARLNGKPVCEDSFSQBVRVNVCVSMHI"*" 
C V W WG VGC VKCLPPRAHHI WQEKPLGPHRT 
VTESKLEAEGKTKEKAREKERKKKS 


570 


1920 


A 


4308 


3 


869 


RSGQGKVYGLIGRRRFQQMDVLEGLNLLITIS 

v-i rvKJ \ fvL, K V Y YJLb WL^KNlvli^rijNiJJj^tVliKJv 

WTTVGDMEGCGHYRVVKYERI1CFLVIALKSS 
VEVYAWAPKPYHKFMAFKSFADLPHRPLLV 
DLT VEEGQRLK VIYG S S AGFHA VD VD SGN S Y 
DIYIPVHIQSQITPHAIIFLPNTDGMEMLLCYE 
DEGVYVNTYGRIIKDVVLQWGEMPTSVAYIC 

blxl^lJVlLj WLriiivALblKci Vb I UHLDO Vr MiiKRA 
QRLKFLCERNDKVFFASVRSGGSSQVYFMTL 
NRNCIMNW 


571 


1921 


A 


4309 


9 


524 


ASREMDVTKVCGEMRYQLNKTNMEKDEAE 
KEHREFRAKTNRDLEIKDQEIEKLRIELDESK 
QHLEQEQQKAALAREECLRLTELLGESEHQL 
HLTRQEKDSIQQSFSKEAKAQALQAQQREQE 
LTQKIQQMEAQHDKTENEQYLLLTSQNTFLT 
KLKEECCTLAKKLEQISQ 


572 


1922 


A 


4318 


1 


1119 


GATPLGSVGGRTGKMDAATLTYDTLRFAEFE 

DFPETSEPVWILGRKYSIFTEKDEILSDVASRL 

WFTYRKNFPAIGGTGPTSDTGWGCMLRCGQ 

MIFAQALVCRHLGRDWRWTQRICRQPDSYFS 

VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQ 

WYGPNTVAQVLICKLAVFDTWSSLAVHIAMD 

NTVVMEEIRRLCRTSVPCAGATAFPADSDRH 

CNGFPAGAEVTNRPSPWRPLVLLIPLRLGLXT 

DINEAYV\ETL\KHCFHGWPQFPG/WHREGK 

PNSAHYFIGYVGEELIYLDPHTTQPAVEPTDG 

CFIPDESFHCQHPPCRMSIAELDPSIAVVRGGH 

lwo 1 ^ArOAbCC-LOM 1 KK lrOr LKxFF SMLG 


573 


1923 


A 


4333 


363 


1066 


GGVPVGLASKPFQ1LYGHTNEVLSVGISTELD 
MAVSGSRDGTVIIHTIQKGQYMRTLRPPCESS 
LFLTIPNLAISWEGHIWYSSTEEKTTLKXERM 
HY1CFSINGKYLGSQILKEQVSDICIIGEHIVTG 
SIQGFLSIRDLHSLNLSINPLAMRLPIHCVCVT 
KEYSfflLVGLEDGKLIWGVGKPABVKPSISN 
FISHAVGDYFGSPSFQLIEKSPLGINKLKAKFD 
FSKGSK 


574 




A 






1 O 1 A 


ML) 1 Lbh. V I W ArNGb 1 ALFPPLAPN1SVPHRCLL 

LLYEDIGTSRVRYWDLLLLIPNVLFLfFLLWK 

LPSARAKIRITSSPIFITFYILVFVVALVGIARA 

VVSMTVSTSNAATVADKILWEITRFFLLAIEL 

SVIILGLAPGHLESKSSIKRVLAITTVLSLAYSV 

TQGTLEILYPDAHLSAEDFNIYGHGGRQFWL 

voiCrfr jl v i iL V VlLrxv 1 r^N-bl\JbL,r' bKK.br i 

VYAGELALLNLLQGLGSVLLCFDIIEGLCCVD 

ATTFL YFSFF APLI YVAFLRGFFG SEPKI LF 


575 


1925 


A 


4360 


2038 


1512 


GCWWRHPWLASQRDCLDCRIQLAEKFVKAV 

SKPSRPDMNPIRVKEVYRLEEMEKIFVRLEM 

KI1KGSSGTPKLSYTGRDDRHFVPMGLYIVRT 

VNEPWTMGFSKSFKKKFFYNKICTKDSTFDLP 

ADSIAPFHICYYGRLFWEWGDGIRVHDSQKP 

QDQDKLSKEDVLSFIQMHRA 


576 


1926 


A 


4365 


69 


500 


QVEGRQGREVKRTAWRISPVWRPARCRRRST 

PQP/PE/PGAQQQERHRQGEAPMQALDPRAEP 

GPQAQSrL\ACQPEPEPPRVLLDPTAARGGVQ 

GRP/GLSRHPGLAPHPQTHTPWPQSGRLPCAS 

EPLPLGGIRPTPGLEPKGRDLM 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C-Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
^Phenylalanine.. G=Glycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Metniomne, N^Asparagine, r— rronne. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, \V=Tryptophan, 
y~l yrosine, A— urucnown, — oiup cuuun, 
/=possible nucleotide deletion, \=possib!e 
nucleotide insertion 


577 


1927 


A 


4366 


785 


502 


CADDi/m/wri^T t?t cpd 1 k' <J Qfi A 1 WVI-KTPTT W 
oArrlvMUNUVLrLjrKLKooUAl w vrio lriLW 

ASSNSRASTPKVAGITGARPHARIIFVFLIEMG 

FHNVGQAGL/DTLTLVICPPQPPKLLGLQM 


578 


1928 


A 


4367 


1 


221 


FFFFLKKSRCVTQAGVQGVPISLHPPPPGFKRF 
SGWSRTPDLR 


579 


1929 


A 


4383 


1 


224 


FETESHSVTQAGMQWHNLGSLQPMP/PGLKR 
FSCLRLQSSWDHRHAPPHLAHFCIFSRDGVSP 
CWPGWSSTPDLK 


580 


1930 


A 


4397 


410 


94 


SRLKPYSTNVTAKKLPATNIPNLDCFTAKLYQ 
WFKKGIMHILHELFQNKEEGAFPNS/FYEASFT 
LRPKSDRDIAKEESYSTISLLSTDTKILMSKYK 
QLKSSDL 


5S1 


1931 


A 


4414 


670 


3 


VLVHRQCGGILRLRRKEAVSVLDSADIEVTDS 

RLPHATIVDHRPQHRWLETCNAPPQLIQGKA 

RSAPKPSQASGHFS VEL VRG Y Aur OL 1 LOub 

RDVAGDTPLAVRGLLKDGPVAQRCGRLEVGD 

LVLHINGESTQGLTMIAQAVERIRAGGPQLHL 

VIRJU^LETHPGKr Kaj V UJbrlvrvO V v ro w rur^or 

DPGGPEVTGSRSSSTSLVQHPPSRTTLKKTRG 

SPE 


582 


1932 


A 


4424 


194 


449 


VLYIRJ<KKRLEKXRHQLMPMYNFDPTEEQDE 

LEQELLEHGRDAASVQAATSVQAMQGKTTL 

PS\QGPLQRPSRLVFT\DVANAIHV 


583 


1933 


A 


4435 


1 


166 


APGPPVPPPGSPPEQMPGPCPASMPP/DPPPGS 
PPEQMPGPCPVSAPP/GPPPGaPrfcv^Mrur'urV 
SAPPALLQDTSV 


584 


1934 


A 


4439 


1 


628 


SATPQQPSAPQHQGTLNQPPVPuMDbbMbY^ 

APPQQLPSAQPPQPSNPPHGAHTLNSGPQPGT 

APATQHSQAGPATGQAYGPHTYTEPAKPKK 

GQQLWNRMKPAPGTXEVSSSTSRSDPLLLPPR 

ALAPTQRASTVVLAPSPT/SEKVQNHSGSSAR 

GNLSGKPDDWP/LGHERVCGALLHRL*VGGG 

QGPHGKAAQGGAAGAAAGRLGLYH 


585 


1935 


A 


4463 


10 


144 


HKPVTNSRDTQEVPLEKAKQVLKIIATFKHTT 
SIFDDFAHYEKRQ 


586 


1936 


A 


4464 


1309 


103 


LNAESYVSFTTKLDIPTAAKYEYGVPLQTSDS 

FLRFPSSLTSSLCTDNNPAAFLVNQAVKCTRK 

INLEQCEEIEALSMAFYSSPEILRVPDSRKKVPI 

TVQSIVIQSLNKTLTRREDTDVLQPTLVNAGH 

FSLCVNVV1,EVKYSLTY 1 DACjfrv 1 kAULor v 

LGTVSSVWPLQQKFEIHFI.QENTQPVPLSGN 

PGYWGLPLAAGFQPHKGSGIIQ'ITNRYGQLT 

ILHST 1 iiQDuLALrliCj VK IrVLr O i 1 lviA^c>u^rv 

LRLTGALPCQLVAQKVKSLLWGQGFPDYVA 

PFGNSQGP/ADMLD WVPIHFITQSFNRKDSCQ 

t -r»/~i at xrrExnswrTzr v<^QT T KTP Pv A T<T TV7\J VT A TOT T 

SSSFPEANSGNERTILISTAVTFVDVSAPAEAG 
FRAPPAINARLPFNFFFPFV 


587 


1937 


A 


4471 


614 


387 


LLGRASAC/LQLQSSW/D/HRPMLPYLANFVF 

CKDRVSFTWLPRLVLNSWLQVILLPWPPTGCD 

NKHEPPCPATKRRHSGSI 


588 


1938 


A 


4480 


1720 


1458 


HDLGSLQPPPPGFKRESCLSLPSSWDYRLMPP 
CPANFCIIII/DFLVETGFHHVGQASHELLTSGD 
PPTSASQSAGITGMSYHTWFGES 


589 


1939 


A 


4487 


922 


332 


APVTTSPRVGQPW/RTALALRSLYRARPSLRC 
PPVELPWAPRRGHRLSPADDELYQRTRISLLQ 
REAAQAMYIDSYNSRGFMINGNRVLGPCALL 
PHSVVQWNVGSHQD1TEDSFSLFWLLEPRIEI 
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SEQ ID 

VIA. 

nucl- 
eotide 
seq- 
uence 


SEQID 

via. ~r 

in w. or 
peptide 
seq- 
uence 


Met 

nOCJ 


SEQ 
in 

USSN 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid. E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K=I,ysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V=Valine 3 W^Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














VVVGTGDRTERLQSQVLQAMRQRGIAVEVQ 
DTPNACATFNFLCHEGRVTGAALIPPPGGTSL 
TSLGQAAQ 


jy\j 


i y^v 


A 




i 


4 /Z 


rrrrt 1 boKbVAQAOVQWKJDLOSLQArrPCir I 

PFSCLSLPSSWDYRRPPLRPANFFVFLVETGFP 

RFSRDGLDLLT/S/GDPPTSASQSAGITGVSHR 

ARPKRIGEPRRKCGNAVVWPSTSLGDHRVTS 

VPHQGGLPGPIRVAPSSAGQREASQGPPGR 


591 


1941 


A 


4495 


1444 


1116 


I AARFTL AKT WN QLKRP\TMID S I KKTR\ YI YT 
MEYYADTERNEIMSFVAGTWVELEAIILSICLM 
LKDNWVEDTlPQGAVPCfATAEGMKRLLFAL 
EPWDSSCFPHPSSGV 


592 


1942 


A 


4496 


2 


919 


RTRPLFSGRPTRPVCTMSDERRLPGSAVGWL 

VCGGLSLLANAWGILSVGAKQKJCWKPLEFL 

LCTL AATHMLN V A VPI ATYS VVQLRRQRPDF 

EWNEGLCKVFVSTFYTLTLATCFSVTSLSYHR 

M WM VC WP VN YRLSN AKKQAGHTVMGIWM 

GSFILSALPAVGWHDTSERFYTHGCRFIVAEI 

GLGFGVCFLLLVGGSVAMGVICTAIALFQTL 

A VQ VGRQ ADHRAFTVPTI V VEDAQGKRRS SI 

DGSEPAKTSLQTTGLVTTIVFIYDCI.MGFPVL 

GPFSLADTHLSDLPYTWGDRDSGGACVM 


593 


1943 


A 


4506 


2 


193 


FFFEAESCSVPQAGVQRPDLGWLHAPPPXGSC 
HFPASASQVAGTTFLARHHTQLIF\AFLVENGL 
C 


594 


1944 


A 


4507 


1327 


647 


KMAGGVRPLRGLRALCRVLLFLSQFCILSGG 
ESTEIPPYVMKCPSNGLCSRLPADCIDCTTNFS 
C I YGKPVTFDCAVKPSVTC VDQDFKSQKNF1I 
NMTCRFCWQLPETDYECTNSTSCMTVSCPRQ 
RYP ANCTVR\DI I VMCLGNRTFPKML YCN WT 
GGYKWVYGLWLLRHHPRWGLGADRFXYLGP 
VAGTASGKLFSFGGLG1WTLIDVLLIGVGYVG 
PADGSLYI 


595 


1945 


A 


4512 


533 


264 


FFFKMESYSVARLECSGAISAPCNLHLLGSNN 
SPASASRV/AGNIGARHHTQQIFVLLVQMRVH 
Y VG QDGLDLL/NLMIHPPRSPK VLGLQ A 


596 


1946 


A 


4513 


3 


1674 


HASDHLYPNFrAnvJELrLKQKQRFEEKRFKLD 

HS VS STNGHR WQIFQD WLGTDQDKLDLAN V 

NLMLELL VQ KICK QLE AE SHAAQLQ ILMEFLK 

VARRNKREQLEQIQKELSVLEEDIKRVEEMS 

GLYSPVSEDSTVPQFEAPSPSHSSIIDSTEYSQP 

PGFSGSSQTKKQPWYNSTLASRRKRJLTAFIFE 

DLEQCYFSTRMSRISDDSRTASQLDEFQEC\LS 

KF\TRYNSVRPL\ATLSYASDLYNGSQYKSLV 

FEFDRDCDYFAIAGVTKKIKVYEYDTVIQDA 

VDIHYPENEMTCNSKISCISWSSYHKNLLASS 

dyegtvilwdgftgqrskvyqehekrcwsv 
dfnlmdpkllasgsddakvklwstnldnsv 

ASjEAKANVCCVKFSPSSRYFn^AFGCADHCV 

HYYDLKNTKQPIMVFKGHRKAVSYAKFVSG 
FFTV^ASTD'sOT K] WMVnkTPWPT RSFKnUTN 

DDI V O/AO J UOyLlvL W IN V VJP\_i \ J \^JLl\OrJVVJrlllN 

EKNFV\GLASNGDYIACGSENNSLYLYYKGLS 
KTLLTFKFDTVKSVLDI<X)RKEDDTNEFYSAV 
CWRALPDGESNVLIAANS\QGTI\KVLELV 


597 


1947 


A 


4518 


536 


824 


RSLALSPGLECSGM1SAHCNLHLLGSSDPPTS 
ASQVAEITSVRHHTWLIFCIVLGQMGFHHVGE 
QAGLELLTSWDPAILPSQSAGIIGMSPHAWPP 


598 


1948 


A 


4524 


1 


384 


FDTEFVNIGGDFDAAAGVFRXCRLPGAYFFSF 
TLGKLPRKTLS VKLN4KNRDE VQAMHT)DGS S 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alamne OCysteine, 
D=Aspartic Acid, b^Uiutamic /vcic, 
F-Phenylalanine, G-Glycine, H-Histidine, 
l=lsoleucine, K=Lysine, L=Leucine, 
fy^Jvietnionine, i\— /\spdrdgi»c, r i iuun^, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon, | 
/=possible nucleotide deletion, \=possible 

niiCJCOilUw UlbtlLlUll 
















RRREMQSQSVMLALRRGDAVWLLSHDHDG 
YGAYSNHGKYITFSGFLVYPDLAPAAPPGLG 
ASELL 




599 


1949 


A 


4526 


366 


776 


MGQPAPYAEGPIQGGDAGELCKCDFLVFTSP 

-n.tt>-c v \ //^i t? a ptd a \/tpota WPON/TF^sP^T/AOAG 
NrEAVCvilALi 1 r Alvlr 1 /\ W jvV^ivijdo^oi/ j-yv^awj 

VQWRDPGSLHPPPLGFKRFSCJLSLPSSWDYK 

HAPPHPANFCIFSRDQVSPCWPGWSRSLDLVI 

PPPWLPKVLGLQA 




600 


1950 


A 


4529 


776 


334 


FFFETESCYVAQAGVQWCDLCSLQAPPPG\SS 
DPPASASRVAGTTGARHHTQLIFVFLVETGFH 
\MLARDGLKLLTSSDPPASASQSSWDYRREPP 
RLANFFVFLVETGSRYVAQAGVQWLFTGAIP 
LLISTGVLTCSVSDLGRFTPP 




601 


1951 


A 


4533 


1460 


403 


HEVQESIHFLESEFSRGISDNYTLALITYALSS 

VGSPKAKEALNMLTWRAEQEGGMQFWVSSE 

SKLSDSWQPRSLDIEVAAYALLSHFLQFQTSE 

GlPlMR^LSRQRNbLCjOr Abl <^L>1 1 

EFAALMNTERTNIQVTVTGPSSPSPVKFLIDT 

IINRLLLQTAELADGTANGSV/SISANGFGFA1 

CQLNVVYNVKASGSSRRRRSIQNQEAFDLDV 

AVKENKDDLNHVDLNVCTSFSGPGRSGMAL 

MEVNLLSGFMVPSEAISLSETVKKVEYDHGK 

LNL YLD S VNETQFC VN IP A VRNFKV SNTQD A 

SVSIVDYYEPRRQAVRSYNSEVKLSSCDLCSD 

VQRLPSL 


602 


1952 


A 


4540 


1963 


295 


MRAPGRPALRPLPLPPLLLLLLSSPWGRAVPC 

VSGGLPKPANITFLSINMKNVLQWTPPEGLQG 

VKVTYTVQYFIYGQKKWLNKSECRNINRTYC 

DLSAETSDYEHQYYAKVKAIWGTKCSKWAE 

SGRFYPFLETQIGPPEVALTTDEKSISVVLTAP 

EKWKRNPEDLPVSMQQIYSNLKYNVSVLNT 

KSNRTWSQCVTNHTLVLTVALEPNTLYCVHV 

ESFVPGPPRRAQPSEKQCARTLKDQSSEFKAK 

IIFWYVLPISITVFLFSVMGYSIYRYIHVGXKEK 

HPVANLlLlYGxNEr DKKrr VFAXtKi v \ii\t i\ i 

NIS\DDSK1SHQDMSLLGKSSDVSSLNDPQPSG 

NLRPPQEEEE VKHLGY ASHLMEIFCD SEENT\ 

EGTSFTQQESLSRT1PPDKTVIEYEYDVRTTDI 

CAGPEEQELSLQEEVSTQGTLLESQAALAVL 

GPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEE 

EPSTTLVDWDPQTGRLCIPSLSSFDQDSEGCE 

rtcir?r*r\r^T r'CEn t CD T vxrpp APnTJPPrrFNFTY 
PSEODoJLoJbb.ULL.orCJL. i licr /ax i^jvrr vj.c,in.c, i i 

LMQFMEEWGLYVQMEN 


603 


1953 


A 


4543 


3 


600 


" YSAVEFVEQASGISDWWNPALRKRMLSDSGL 
GMIAPYYEDSDLKDLSHSRVLQSPVSSEDHAI 
LQAVIAGDLMKLIESYKNGGSLLIQGPDHCSL 
LHYAAETGNGEIVKYILDHGPSELLDMADSE 
TGETALHKAACQRNRAVCQLLVDAGASLRKX 
TDSKGKTPQERAQQAVGDPDLAA/YTIESRQN 
YKVIGHEDLETAV 


604 


1954 


A 


4D4v> 


J 


938 


" QDNKVQNGSLHQKDTVHDNDFEPYLTGQAN 
QSNSYPSMSDPYLSSYYPPSIGFPYSLNEAPW 
STAGDPPIPYLTTYGQLSNGDIIHFMIIDAVFG 
QPGGLGNNIYQHRFNFFPENPAFSAWGTSGS 
QGQQTQSSAYGSSYTYPPSSLGGTWDGQPG 
FHSDTLSKAPGMNSLEQGMVGLKIGDVSSSA 
VKTVGSVVSSVALTGVLSGNGGTNVNMPVS 
KPTSWAAIASKPAKPQPKMKTKSGPVMGGG 
LPPPPIKHNMD1GTWDNKGPVPKAPVPQQAP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
sea- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

111 1 11 lg 

nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

nuclpfitidp 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteinc 3 

-L-' /VDJJdi C — VJ 1 (J Loll 1 1 /A CI (J, 

F=PhenyIaIanine. G=Glycine, H=Histidine, 
I=Isolcucine K==I v^ine I ^Leucine 
M=Methionine s N=Asparagine, P=Proline ; 
Q=G I utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, W^Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














SPQAAPQPQQVAQPLPAQPPALAQPQYQSPQ 
QPPQ 


605 


1955 


A 


4553 


2 


2304 


ILLQEKRNCLLMQLEEATRLTSYLQSQLKSLC 

ASTLTVSSGSSRGSLASSRGSLASSRGSLSSVS 

FTDIYGLPQYEKPDAEGSQLLRFDLIPFDSLGR 

DAPFSEPPGPSGFHKQRRSLDTPQSLASLSSRS 

SLSSLSPPSSPLDTPFLPASRDSPLAQLADSCE 

GPGLGAJLDRLRAHASAMGDEDLPGMAALQP 

HGVPGDGEGPHERGPPPASAPVGGTVTLRED 

SAKRLERRARRISACLSDYSLASDSGVFEPLT 

KRNEDAEEPAYGDTASNGDPQ1HVGLLRDSG 

SECLLVHVLQLKNPAGLAVKEDCKVHIRVYL 

PPUDSGTPNTYCSKALEFQVPLVFNEVFRIPV 

HSS ALTLKSLQL YVCS VTPQLQEELLGI AQIN 

LADYDSLSEMQLRWHSVQVFTSVLNHQGRGR 

LGVQERAPPGTLHTPSPSPA/STDAVTVLLAR 

TTAQLQAVERELAEERAKLEYTEEEVLEMER 

KEEQAEAISERSWQADSVDSGCSNCTQTSPPY 

KVDKETNTEDLFLEEAASLVKJEI^PSRRARGSP 

FVRSGTIVRSQTFSPGARSQYVCRLYRSDSDS 

STLPRKSPFVRNTLERRTLRYKQSCRSSLAEL 

MARTSLDLELDLQASRTRQRQLNEELCALRE 

LRQRLEDAQLRGQTDLPPWVLRDERLRGLLR 

E AERQTRQTKLD YRHEQAAEK MLKKASICEI 

YQLRGQSHKEPIQVQTFRERIAFFTRPRJNIPPL 

PADDV 


606 


1956 


A 


4555 


3429 


776 


PGSGPGPAPFLAPVAAPVGGISFHLQIGLSREP 

VLLLQDSSGDYSLAHVREMACS1VDQKFPEC 

GFYGMYDKILLFRHDPTSENILQLVKAASDIQ 

EGDLIEVVLSASATFEDFQIRPHALFVHSYRA 

PAFCDHCGEMLWGLVVRQGLKCEGCGLNYH 

KRCAFKIPNNCSGVRRRRLSNVSLTGVSTIRT 

SSAELSTSAPDEPLLQKSPSESFIGREKRSNSQ 

SYIGRPIHLDKILMSKVKVPHTFVIHSYTRPTV 

CQYCKKLLKGLFRQGLQCKDCRFNCHKRCA 

PKVPNNCLGEVTINGDLLSPGAESDVVMEEG 

SDDNDSERNSGLMDDMEEAMVQDAEMAMA 

ECQNDSGEMQDPDPDHEDANRTISPSTSMNIP 

LMR VVQS VKFITKRKSST VMKEG WM VHYTS 

KDTLRKRHY\\nRJLDSKClTLFQNDTGSRYYKE 

IPLSEILSLEPVKTSALIPNGANPHCFEITTANV 

VYYVGENVVNPSSPSPNNSVLTSGVGADVAR 

MWEIAIQHALMPVIPKGSSVG'IG'raLHRDlSV 

SISVSNCQIQENVDISTVYQIFPDEVLGSGQFG1 

VYGGKHRKTGRDVAIKIIDKJLRFPTKQESQLR 

/\iLj\^ri Lnnru v V IN l^C^lVir H I r EK V r Y V ivl 

EKLHGDMLEMILSSEKGRLPEHITKFL1TQILV 
ALRHLHFKNIVHCDLKPENVLLASADPFPQV 
KLCDFGFARI1GEKSFRRSVVGTPAYLAPEVL 
RNKGYNRSLDMWSVGVIIYVSLSGTFPFNED 
EDIIIDQIQN AAFM YPPNP WKEI SHEAIDLIKN 
LLQVTCMRKRYSVDKTLSHPWLQDYQTWLDL 
RELECKJGERYITHESDDLRWEKY AGEQG1 ,Q 
YPTHLINPS A SHSDTPETEETEMKALGER V S1L 


607 


1957 


A 


4563 | 


1 


4499 


SRPWWLRASERPSAPSAMAKRSRGPGRRCLL 

ALVLFCAWGTLAWAQKPGAGCPSRCLCFRT 

TVRCMHLLLEAVPAVAPQTSILDLRFNRIREI 

QPGAFRRLRInXNTLLLNNNQIKRJPSGAFEDL 

ENLKYLYLYK^FQSIDRQAFKGLASLEQLYL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Ammo acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E^Glutamic Acid, 
FHPhenylaianinc, G=Glycinc, H=Histidine, 
I=Isoleucine, K>=Lysme, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine ? 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














HFNQIETLDPDSFQHLPKLERLFLHNNR1THL 

VPGTFNHLESMKKLRLDSNTLHCDCE1LWLA 

DLLKTYAESGNAQAAA1CEYPRRIQGRSVATI 

TPEELNCERPRITSEPQDADVTSGNTVYFTCR 

AJEGNPKPEIIWLRNNNELSMKTDSRLNLLDD 

GTLMIQNTQETDQGIYQCMAKNVAGEVKTQ 

EVTLRYFGSPARPTFVIQPQNTEVLVGESVTL 

EC S ATGHPPPR1 S WTRGDRTPLP VDPR VNITPS 

GGL Y1QN WQGD SGE Y AC S ATNNIDS VHAT A 

FIIVQALPQFTVTPQDRWIEGQTVDFQCEAK 

GNPPPVIAWTKGGSQLSVDRRHLVLSSGTLRI 

SGVALHDQGQYECQAVNIIGSQKVVAHLTVQ 

PRVTPVFASIPSDTTVEVGANVQLPCSSQGEP 

EPAITWNKDGVQVTESGKFHISPEGFLTINDV 

GPADAGRYECVARNTIGSASVSMVLSVNVPD 

VSRNGDPFVATSIVEAIAT\0)RAIKSTRTHLF 

DSRPRSPNDLLALFRYPRDPYTVEQARAGEIF 

ERTLQLIQEHVQHGLMVDLNGTSYHYNDLVS 

PQYLNLIANLSGCTAHRRVNNCSDMCFHQKY 

RTHDGTCNNLQHPMWGASLTAFERLLKSVY 

ENGFNTPRGINPHRLYNGHALPMPRLVSTTLI 

GTETVTPDEQFTHMLMQWGQFLDIIDLDSTV 

VALSQARFSDGQHCSNVCSNDPPCFSVMIPPN 

DSRARSGARCMFFVRSSPVCGSGMTSLLMNS 

VYPREQTNQLTSYIDASNVYGSTEHEARSIRD 

LASHRGLLRQGIVQRSGKPLLPFATGPPTECM 

RDENESPIPCFLAGDHRANEQLGLTSMHTLW 

FREHNRIATELLKLNPHWDGDTIYYETRKIVG 

AEIQHITYQHWLPKILGEVGMRTLGEYHGYD 

PGINAGIFNAFATXAAFRFGHTLVNPLLLPGLD 

ENFQPIAQDHLPLHKAFFSPFRIVNEGGIDPLL 

RGLFGVAGKMRVPSQLLNTELTERLFSMAHT 

VALDLAAINIQRGRDHGIPPYHBYRVYCNLS 

/VAHTFEDLKNEIKNPEIREKLKRLYGSTLNID 

LFPALVVEDLVPGSRLGPTLMCLLSl QFKRLK 

DGDRLWYENPGVFSPAQLTQIKQTSLARILCD 

NADNITRVQSDVFRVAEFPHGYGSCDEIPRVD 

LRVWQDCCEDCRTRGQFNAFSYHFRGRRSLE 

FSYQEDKPTKKTRPRKIPSVGRQGEHLSNSTS 

A\FSTRSDASG\TNDFQR VCS WEMQKTITDLR 

TQIKKLESRXLSTTECVDAGGESHANNTKWK 

KDACTICECKDGQVTCFVEACPPATCAVPVNI 

PGACCPVCLQKJRAEEKP 


608 


1958 


A 


4566 


354 


1135 


FSFLC/GVSGRLGLDbbbDY Y 1 r QR V JJ V rKAL 
IIVAVQCGCDGTFLLTQSGKVLACGLNEFNKL 
GLNQCMSGIINHEAYHEVPYTTSFTLAKQLSF 
YKIRTIAPGKTHTAATDERGRLLTFGCNKCGQ 
LGVGNYKKRLGlNLLGGPLGGKQVrRVSCGD 
EFTI AA I DDNHli* AW ON Cj LrN uKLAM If 1 bKr 
HGSD1CTSWPRPIFGSLHHVPDLSCRGWHTILI 
VEKVLNSKTIRSNSSGLSIGTVFQSSSPGGGGE 


609 


1959 


A 


4567 


1 


412 


FFFFETESRSVAQAGVQWRDLGSLQAPPPGFT 

PFSCLSLPSSWDYRRPPLRPANFFVFLVETGF 

HRFSRDGLDLLT/S/GDPPASASQSAGITGVSH 

RARPRINLRNV1YSFAVTYCLNYISLAMSSTL 

KLSFHVLSGS 


610 


1960 


A 


4570 


697 


467 


ECRGVISAH\CCTLCLPSSSDSASAF\RVARTT 
GTCDYAQLIFAFLVEMGFHHVGQDGLHLL/N 
LVIRPPRPPKVLGLQA 
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SEQ ID 
NO: ox 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NU: or 
peptide 
seq- 
uence 


Met 
nod 


SEQ 

TJ~\ MA. 

ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
.location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteinc, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K=Lysine, L=Leucine, 
M=Methioninc, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W-Tryptophan, 
Y— Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


611 


1961 


A 


4571 


25 


1396 


ADPHTTVIRFFPAASATKRVLPPVLRVSSPRT 

WNPNVPESPRIPAPRLPKKMSGAPTAGAALM 

LCAATAVLLSAQGGPVQSKSPRTASWDEMN 

VLAHGLLQLGQG\CANT\GAHPQSAERAGA\R 

LSACGSACQGTEGSTDLPLAPESRVDPEVLHS 

LQTQLKAQNSRIQQLFHKVAQQQRHLEKQHL 

RJQHLQSQFGLLDHKHLDHEVAl^AilRlUR^ 

EMAQPVDPAHNVSRLHRLPRDCQELFQVGER 

QoGL> bIQrQubrrrLVNCKJWToDGGWTVIQR 

RHDGSVDFNRPWEAYKAGFGDPHGEFWLGL 

EKVHSITGDRNSRLAVQLRDWDGNAELLQFS 

VHLGGEDTAYSLQLTAPVAGQLGATTVPPSG 

i p\/dcctu fnAnunT ddhi'xtp a tx* o t CAA^i/n;r 

" GTCSHSNLNGQYFRSIPQQRQKXIOCGIFWKT 
WRGR Y YPLQ ATTMLIQPMAAEAA S 


612 


1962 


A 


4575 


162 


3 


FFFETESRSVAQAGVQWRDLSSLQPPPPGXSR 
GSPASASPVAGITGTRHHRTRG 


oli 


1963 


A 


A CO A 

4584 


^0*7 

6V7 


321 


PLAQRRPFLWVTVKTNGHIWGSSTYPHFWGS 
SNS/PASASQVAGIPNARHQARIIFVFLVEPRF 
HFIVGRAGLGFL/NLAICLPQHPKVLGLQACN 
LNIKPHPAHKYISMIQFNVHFMCMSVHIYI 


614 


1964 


A 


4589 


727 


299 


PGSAQSAQRGRGRRRARAGSATQITMYSFMG 
GGLFCAWVGTILLVVAMATDHWMQYRLSGS 
FAHQGLWRYCLGNKCYLQTDSIAYWNATRA 
FMILSALCAISGnMGIMAF/GWVAVLMTFFA 
GIr YMCA YRVHECRRLbl PR 


615 


1965 


A 


4590 


2 


414 


TILPEKIQAWAQKQCPQSGEEAVALVVHLEK 
ETGIU.RQQVSSPVHREICHSPLGAAWEVADFQ 
PEQVETQPRAVSREEPGSLHSGHQEQLNRKR 
ERRPLPKNARPSPWVPALADEWNTLHQEVTT 
I RLPAGbQbr VKD 


616 


1966 


A 


4592 


773 


488 


DFALVAQAGVQWHNLGSPQPLPPGFKRFSCL 
SLPSSWEYRCVPP/RLANFVFLVEMGFLHVGQ 
AGLELPTSGDPPALASQSAGITGVTTVPSGPG 


617 


1967 


B 


4595 


84 


478 


XRHGLREPLLERRCAAASSFQHSSSLGRELPY 

DPVDTEGFGEGGDMQERFLFPEYILDPEPQPT 

REKQLQELQQQQEEEERQRQQRRJEERRQQNL 

RARSREHPVVGHPDPALPPSGVNCSGCGAEL 

HCQDAR* 


618 


1968 


A 


4596 


2945 


1188 


ARSRNSARGVYGMCVDTLFLCFLEDLERNDG 

SAERPYFMCSTLKKPLARRCFPAIHAYKGVL 

MVGNETTYEDGHGSRKNITDLVEGAKKANG 

VLEARQLAMRIFEDYTVSWYWiriGLVIAMA 

MSLLSliLLHLLAGIMGWVMIIMEIVSELGYRIF 

HCYMEYSRLRGEAGSDVSLVDLGFQTDFRV 

YLHLRQTWLAFMIILSILEVIIILLLIFLRKRILI 

AIALIKEASRAVGYVMCSLLYPLVTFFLLCLCI 

AYWASTAVFLSTSNEAVYKIFDDSPCPFTAKT 

CNPETFPSSNESRQCPNARCQFAFYGGESGYH 

RALLGLQIFNAFlVIFFWnLANFVLALGQVTLAG 

A ITA QVVWA T PlfPnT^r PAJ7DT 1?C AX2C\T> at dvu 

TGSLAFGALILAIVQIIRVILEYLDQRLICAAEN 

Is^AKCLMTCLKCCFWCLEKFIK^LNRNAYIM 

IAn r GTNFCTSARNAFFLL\4RNIIRVAVLDKV 

TDFLFLLGKLLIVGSVGILAFFFFTHRIRIVQDT 

APPLNYYWVP1LTVIVGSYLIAHGFFSVYGMC 

VDTLFLCFLEDLERNDGSAERPYFMSSTLKKL 

LNKTNKJCAAES 


619 


1969 


A 


4601 


2 


357 


RTSVEPYILGEF/RKLSNNTKWKTEYKATEY | 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted | 

beginning 

nucleotide 

location 

correspond] 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phcnylalanine, G=G lycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionme ? N=Asparagine, P=Proiine, 
Q=Glutamine 5 R=Arginine ; S=Serine, 
T-Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possib!e 
nucleotide insertion 














GLAYGHFSYEFSNHRDVVVDLQGWVTGNGK 

GLIYLTDPQ1HSVDQICVFTTNFGKRGIFYFFN 

NQHVECNE1CHRLSLTRPSMEKPCKS 


620 


1970 


A 


4606 


1 


2415 


MERLWGLFQRAQQLSPRSSQTVYQRVEGPR 

KGHLEEEEEDGEEGAETLAHFCPMELRGPEP 

LGSRPRQPNLIPWAAAGRRAAPYLVLTALLIF 

TGAFLLGYVAFRGSCQACGDSVLVVSEDVN 

YEPDLDFHQGRL Y W SDLQ AMFLQFLGEGRL 

EDTIRQTSLRERVAGSAGMAALTQDIRAALS 

RQKLDHVWTDTHYVGLQFPDPAHPNTLHWV 

DEAGKVGEQLPLEDPDVYCPYSAIGNVTGEL 

VYAHYGRPEDLQDLRARGVDPVGRLLLVRV 

GVISFAQKVTNAQDFGAQGVLIYPEPADFSQ 

DPPKPSLSSQQAVYGHVHLGTGDPYTPGFPSF 

NQTQFPPVASSGLPSIPAQPISADIASRLLRKL 

KGPVAPQEWQGSLLGSPYHLGPGPRLRLVVN 

NHRTSTPINNIFGCIEGRSEPDHYWIGAQRDA 

WGPGAAKSAVGTAILLELVRTFSSMVSNGFR 

Pl^RSLLFlSWDGGDFGSVGSTEWLEGYLSVL 

HLKAVVYVSLDNAVLGDDKFHAKTSPLLTSL 

IESVLKQVDSPNHSGQTLYEQVVFTN\PSWD\ 

AEVIRPLPM\DSSAY\SFTAFVGVPAVEFSFME\ 

DDQ\AYPFLHTKEDTYENLHKVLQGRLPAVA 

QAVAQLAGQLLIRLSHDRLLPLDFGRYGDVV 

LRHIGNLNEFSGDLKARGLTLQWVYSARGDY 

IRAAEKLRQEIYSSEERDERLTRMYNVRIMRV 

EFYFLSQYVSPADSPFRHIFMGRGDHTLGALL 

DHLRJLLRSNSSGTPGATSSTGFQ\ESRFRRQL\ 

ALL\TWDACKGAANALSGDV\VNIDNNF 


621 


1971 


A 


4610 


793 


334 


ISRVDDFVGSGIANVIIAVAIFSIPAFARLVRG\ 

NTLVLKQQTFIESARS1GASDMTVLLRH1LPGT 

GSSIVVFFTMRIGTSHSAASLSFLGLGAQPPTP 

EWGAMLNEARADMVIAPHVAVFPALAIFLTV 

LAFNLLGDGLRDALDPKIKG 


622 


1972 


A 


4614 


2 


820 


LVYVMIAIFC1ASAMSLYNCLAALIHKIPYGQ 

CT1ACRGKNN4EVRLIFLSGLCIAVAWWAVF 

RNEDRWAWILQDILGIAFCLNL1KTLKLPNFK 

SCVILLGLLLLYDWFVFITPFITKNGESIMVEL 

AAGPFGNNEKNDGNLVEATGQPSAPHEKLPV 

VIRVPKLIYFSVMSVCLMPVSILGFGDIIVPGL 

LIAYCRRFDVQTGSSYIYYVSVXTVAYAIGM1L 

TFWLG\LMKKGQPALLYLVPCTLITA/CQFV 

AWETVREMKKFWERVTS 


623 


1973 


A 


4619 


17 


691 


TLVSVVEFVRRADLTREDLAPSSVDSGQAGF 
GGCCESGLPNTMPSAFSVSSFPVSIPAVLTQT 
DWTEPWI .MGLATFHALCVLLTCLSSRSYRLQ 
1GHFLCLVILWCAEYINEAAAMNWRLFSKY 
QYFDSRGMFISIVFSAPLLVNAMIIVVMWVW 
KTLN VMTDLKN AQERRKEKKRRRICED* GAA 
AAWSLRPSRPPSAAPSAAVCVAWASFQLTHG 
LKNRCr 1 


624 


1974 


A 


4622 


164 


668 


VSCYTALQSIMNQPESANDPEPLCAVCGQAH 

SLEENHFYSYPEEVDDDL1CHICLQALLDPLD 

TPCGHTYCTLCLTNFLVEKDFCPMDRKPLVL 

QHCICKSSILVNKLLNKLLVTCPFREHCTQVL 

QRCDLEHHFQTSQAWGTHL* SQLLGRLRQED 

CLSPGVHHCSEV 


625 


1975 


A 


4625 


474 


473 


CFLSPSPLLPPLLLSSSSSPSFPLPPPPTLLPSTLP 
PPLLIPSS*LSP 
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SEQ ID 
NU: ot 
nucl- 
eotide 
seq- 
uence 


SEQ ID 

NUt Oi 

peptide 
seq- 
uence 


Met 

i_ j 
nod 


SEQ 
in 

USSN 
914 


Predicted 
beginning 
nucleotide 
location 

nrirf PC rN r\ t-\ A \ 

ccirTCapuna i 
ng to first 
amino acid 
residue of 

neurit lH h» 

pepuue 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C=Cysteine, 
D=Aspartic Acid, b=Oiutamic Acid, 
F^Phenylalanine; G=Glycine s H=Histidine 3 
I=Isoleucine, K=Lysine, L=Leucine, 
M = Methionine, N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan } 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/"possible nucleotide deletion, \ = possible 
nucleotide insertion 


626 


1976 


A 


4629 


249 


3 


KLKGNECFCYHCNVCIFLM1KJC*GLFLC*I YFI 
LFFET* SHSFTRLECSGTIS AHCSLQLQG SSNSP 
ASASQVAGIAGTHH 


627 


1977 


A 


4635 


1 


301 


FFFFETKPFFAPQAGGQGPSRGSLNPLPTGLK 
IvbRbONNGPIO^PrRVNFGILRGNGVP 
PGGAG*PRPPDLRGPPGLAPPQGGNNGGDPP 
ARAYL 


628 


1978 


A 


4648 


1357 


782 


I<CLFSSQRLFGPHIQAINPSFLLLSFFPS*LLAMR 

TVGNNAFILVFLVYRIVLLLF*HV*PAYFQPSK 

NKTAKINCN*RPF1,FLVCYLL*AELHIGIF1ANF 

YDCIPNKLNEHLWPKLLQSLIFHVDFCGFLHlv 

VFY1CFTEFLLFLYFL*LFIIKVSCSII*CSTICVF 

SYKSFAVIIFFVDNTRFFSFGF 


629 


1979 


A 


4660 


18 


999 


HHELHTLELLQNPKEVLTRSEIQDVNYSLEAV 

KVKTVCQIPLMKEMLKRFQVAVNLAEDTAH 

PKLVFSQEGRYVKNTASASSWPVFSSAWNYF 

AGWRNPQKTAFVERFQHLSCVLGKNVFTSG ! 

KHYWEVESRDSLEVAVGVCREDVMGITDRS 

KMSPDVGIWAIYWSAAGYWPLIGFPGTPTQQ 

EPALHRVGVYLDRGTGNVSFYSAVDGVHLH 

TFSCSSVSRLRPFFWLSPLASLVIPPVTDRK*G 

FSSPDQNSFPVVQLRDTHPWALFCPSCLYPG 

WS1FWVSLTVPFG1CPJLCASQEAVPWEVGLA 

NuDU 1 GNrrRRh WhlrL 


630 


1980 


A 


4669 


2 


358 


FFFFFETESHSVAQAGMQWRNLGSLPAPPPGF 
TPFFCLSLLNGWDYRRPPPHLANFFVLLVETG 

X2\XVW7 f fWl/^ 1 T r~\I T TP4CTDO \ PrvC A CTTT'AfCl.rP 

rHUVuQDuLULL I i roAbQSAEITGVSHC 
TRLKKIRFAKGHVEFFFESHVE 


631 


1981 


A 


4674 


953 


614 


TPIRGTDDEHEECTVQEYSAGKNTCLRPGAV 
AHTCNPCTLGGRGRWIT*GSGVQDQPGPTWQ 
NPVFLERRPRALHS SPGLTTQRIL W AQGL W V 
GAGSTGCSRGPRGEGVFREG 


632 


1982 


A 


4678 


34 


314 


RSTPLASGMISPSFGFMGHLLRLEFEILPSTPNP 
*LPSYQGEAAGSSL1SHLQTFSPDLKGVYCTFP 
ASGLAPVPTHWTVSELSRSPVATATFC 


633 


1983 


A 


4696 


1 


1365 


RTLGMEGERRASQAPSSGLPAGGANGESPGG 

GAPFPGSSGSSALLQAEVLDLDEDEDDLEVFS 

KDASLMDMNSFSPMMPTSPLSMINQIKFEDEP 

DLKDLFITVDEPESHVTTIETFITYRIITKTSRG 

EFDSSEFEVRRRYQDFLWLKGKLEEAHPTLII 

PPLPEKF1VKGMVERFNDDFIETRRKALHKFL 

NRIADHPTLTFNEDFKJFLTAQAWELSSHKKQ 

GPGLLSRMGQTVRAVASSMRGVKNRPEEFM 

EMNNFIELFSQKINLIDKISQRJYKEEREYFDE 

MJvb Y UrlrllL W bAbhbDL V U I LKD V ASCIDRC 

CKATEKRMSGLSEALLPVVHEYVLYSEMLM 

GVMKRRDQIQAELDSKVEVLTYKKADTDLL 

PEEIGKLEDKVECANNALKADWERWKQNM 

v^NUiPLLAr J DMAbbNLHY YbQCLATWESFLT 

SQTNLHLEEASEDKP 


634 


1984 


A 


4708 


421 


158 


SYWVGFDYTYKFFFVII TDPFHk' ATRRNPDTO 
WISKA V YKHREMCG LTSTGRKSHGLEKDRM 
FPHAIGGSCRAA*RRRKTLQFPCYH 


635 


1985 


A 


4709 


42 


341 


YTKQPDAKERRRTVHWKKETESEASE1TIPPST 
PGVPQAPGHWEDYGRGDNFYLPH*DPGGIVL 
WNIFNRMPIARKNITDGEHHEYLIEVPRLFHT 
SED 


636 


1986 


A 


4721 


2 


351 


EKPDHFFPEGTSFIHEPRRPN*GDLVHCLGG1S 
RSTTVTVA*LMQKLNLSMNDAYYIVIMKMSS 
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SEQ ID | 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aianine C=Cysteine, 
D~Aspartic Acid, E=Glutamic Acid, 
^Phenylalanine, G=Glycinc, II=Histidine, 
I— Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine ; W=Tryptophan 9 
Y=Tyrosine 5 X=Unknown, *==Stop codon, 
/=possiDie nucieotiae aeieuun, \=pobMuic 
nucleotide insertion 














ISPNFNSMDQPLDFQRTLGLRSPCYNRVPAQK 
MYFTrPSNHNAYQVDSVQST 


637 


1987 


A 


4726 


664 


253 


NTGLTCSIQRKCGETQLYRREENRLILLLQDH 
LKSESFQVLTLSPRLEFSGLISAHCNLRLPGSS 
D S S A S S S RAACj 1 1 0 V i-iririA W L* ir r r L> v c l or 
HAG*AGLELLTSGDPPASASRSAGITGVSHHA 

RPRETRFL 


63 S 


1988 


A 


4734 


24 


592 


GGMDSRVSGTTSNGETKPVYPVMEKKEEDU 
TLERGHWNNKMbr VLb VAijrJMiUJLUJN v wiur 
YLCYKNGGGAFFIPYLVFLFTCGIPVFLLETAL 
GQYTSQGGVTAWRKICPIFEGIGYASQMIVIL 
LNVY Yll VLAWALF YLFSSFT1DLP WGGCYHE 
WNTEHCMEFQKTNGSLNGTSENATSPVIEFW 


639 


1989 


A 


4743 


1040 


699 


QGLTLLPRMECSATITAIICSLELPGSIDLPTSA 
S*VARTTGTHHHPWElLVLLi/ p l vvuo i i vav 
AGLELLGSSNLPAAMVSQSAQIIGHDHCAWA 
TSNHVLYTQEGLRRGKEG 


640 


1990 


A 


4771 


527 


2 


GRIDCPHPATVLAQPIFIDACSVLGAYQGAQN 

WIRRRPCLPSGCLKMNRE1GPLQHSLCCPGWS 

QTPGLKAJLLRQPPK*LGLQMbbHbUrrAW 

MARSRLTATSASQVQAILLPQPPGTTDSCSPS 

PDHEQQPLSWVLPPPQKDMNPREQQVALGP 

QAAALPWAVWRNDCFPR 


641 


1991 


A 


4780 


16 


473 


RPSSQCGGIPTGWKKGLAPELSSELSSPPLPAR 
LQLAASPYFSPSWAECPQPVPAGTHATWCLA 
RVWARMTPPGPAGlPbrirLrrrrrbKC) vriror 
FP ARD SGSRQGHSTDRYKHTDAPRD AHRRVP 
QRDTDTG VHTG SGTHTHAHTPPEK 


642 


1992 


A 


4798 


1 


487 


GYSFRCDIVDYSRSPTALRMARTCWLYYFSR 

F1ELLDTIFFVLRKKNSQVTFLHVFHHTIMPW 

TWWFGVKx AAGuLG 1 rrlALJLrN lAVriv visa i 

SYYGLSALGPAYQKYLWWKKYLTSLQLVQF 

VIVAIHISQFFFMEDCKYQFPVFACTTMSYSFM 

FLLLFLH 


643 


1993 


A 


4799 


2 


391 


LMAFIEMHlSGSLVYEKiKlKl i £> YraftAi^iNri.^ 
QEIPLSEILRISSPRDFTNISQGSNPHCFEIITDT 
MVYFVGENNGDSSHNPVLAATGVGLDVAQS 
WEKAIRQALMPVTPQASVCTSPGQGKDHSK 
Q * A S V CTSFG QolsJJri alv^J 


644 


1994 


A 


4800 


488 


101 

t 


AYPLFAVHPVHTECVAGVVGRAYLLCALFFL 
LSFLGYCKAFRESNKEGAHSSTFWVLLSIFLG 
AVAMLCKEQGITVLVRAATWLGPAFSVCPFP 
SYKDIWGWPCLCGYLHAYIPLLV 


645 


1995 


A 


4805 


458 


126 


" ' LLWTTVLCQTPARPQSTM1HLGHILFLLLLPV 
AAAQTTPGERSSLPAFYPGTSGSCSGCGSLSL 
PLLAGLVAADAVASLLIVGAVFLCARPRRSP j 
AQEDGKVYINMPGRG 


646 


1996 


A 


4817 


47 


1033 


LQGDTWrlLSFLSHFSRLHGGVPGRGLLEGNl. 

LQPQAPGHDMTSIPFPGDRLLQWGVILCGLT 

HKQAVQCLKGPGQVARXVLERRVFRb 1 QQC 

PSANDSMGDERTAVSLVTALPGRPSSCVSVT 

DGPIO : *SSN*KIUANGLGFSFVQMEKESCSFIL 

KSDLVRIKRLFPGHPAEENGA1AAGD1ILGRE 

WEGPRKAS SSRCRGS W AMQLS VQAGPSF AS 

YYPAAVEVLHLLRGAPQEVTLLLCRPPPGAL 

PELEQEWQTPELSADKEFTRATCTDSCTSPIL 

GSRGQLGGTVPPQMQGKAWGLRPESSQKAIR 

EGTMGAKTERDLGPVP 


647 


1997 


A 


4854 


1 1044 


335 


PRVRGDWPLEICKKSNSNIHPIFSWCGSTDSKD 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

Kecrinnino 
nucleotide 
location 
corresponds 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

niiplpfitiHp 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

F=Phcnylalanine, G=Glycinc, H=Histidine, 
I=Isoleucine, K=Lysine, L=-Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine ; V=Valine, \V=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 

/ = no<Kiblp nnrlontiHo delt-tinn \s=rwj«ihl^ 

nucleotide insertion 














1VMPTYDLTDSVLETMGRVSLDMMSVQANT - 

GPPVVESKNSTAVWRGRDSRKJERLELVKLSRK. 

HPELIDAAFTNFFFFKHDENLYGPl\nCHISFFD 

FFKHKYQINIDGTVAAYRLPYLLVGDSVVLK 

QDS1YYEHFYNELQPWKHYIPVKSNLSDLLEK 

LKWAKDHDEEAKKIAKAGQEFARTsJNLMGD 

DIFCYYFQTFPRNMPIYK 


648 


1998 


A 


4867 


2030 


837 


AGMLPAVG S ADEEEDP AEEDCPEL VPMETTQ 
SEEEEKSGLGAKIPVTHTGYLGAGKTTLLNY1 
LTEQHSKRVAVILNEFGEGSALEKSLAVSQG 
GELYEEWLELRNGCLCCSVKDNGLRAIENLM 

ELGSDIYLDG11TIVDSKYGLKHLAEEKPDGLI 
NEATRQVALADAILINKTDLVPEEDVKKLRT 

TIRSINGLGQILETQRSR\rDLSNVLDLMAFDSL 

^HT^i OKifi nwvpnTnPT-TT nr><2TVTiT?rn\/P/": 
ovjioi^v^/jvisjuVs^ij v r vj i v^rriLi/vji v 1 i 1 ru vjtvj 

NAICEEHLNMFIQNLLWEICNVRNKDNHCMEV 

IRLKGLVSIK.DICSQQVIVQGVHELYDLEETPV 

SWICDDTERTNRLVLLGRNLDKDILKQLFiAT 

VTETEKQWTTHFKEDQVCT 


649 


1999 


A 


4873 


226 


IRQ 


FSCLSLRSSWD*KCAPPHPAFVFLVEMGFHRV 

GQAGLELRTSGDPPASASQSAGITGVSHLA+P 

TSMPLLPFQRECVYI 


650 


2000 


A 


4874 


2 


437 


FFFLRRSFAFVAQAGVQWCDLGSPQPLPPGF 
K*FSCLSLPSSWDYRHAPPPCPS*FLYF**RQG 
FTMLARLVLNS*PHDLPTSPSQSAEIKGVSHR 
CPASFYLFLKYYLEAKFCA+GECAPSAGVGA 
GYKRG HKSCLLINC VVQI 


651 


2001 


A 


4898 


1701 


771 


DAWGPETRLARILNPDSF1EPRPGRLPELEATR 

AALICLPLLVSKLLDIPGLEVAVVTTERAKHFY 

SPQDIPVTLYSDADEWEMWKSRSDPVLHIDL 

RRWADLLLVAPLDANTLGKVASGTCDNLLTC 

VMRA \VDRSKPLLFCPAMNTAM WEHPITAQQ 

VDQLKAFGYVEIPCVAKKLVCGDEGLGAMA 

EVGTIVDKVKEVLFQHSGFQQS*PGISVMGVP 

i Y c ?F\vvnAK'9VivMnvpiT^7nnvpi-n t kj^hpa 

j-» i ob vv v v tvivii^ v vjivivjvj 1 i XiJL>.L»rN vj LJ,r.r\ 

LSLPRGQACSRLNWTEGPGLSFFQPGEAAA 


652 


2002 


A 


4927 


1 


611 


FRGRQTSRPARGFSPWRPPGTMQEPSSGECPA 
SP*LPCASNR1 AFGGT IFPFAPI VPYPAPF^PI T 

PAFSCAPRPRAHTHSRTHPSAPLVPKPSSRAR 

GQSPIPSRASSPSCSWAQVPGVALARCAGVC 

KPGDSWRVAACISGRCCSRGRRRGSGPRNPE 

QSFRGAWGPSFWGSWKSQRELSAGGAQAWP 

LLGSAGSGLRGEA 


653 


2003 


A 


4965 


2 


283 


FFFFI* DGVSLCHPGWN AVARS WLTATSASR 
VOAVSCFRLPSSWDYRHATMPG*FF*YF**rc 
W GFTrLAIL VLNS * PQV1CPP WPPK VLTLQA 


654 


2004 


A 


4968 


3 


437 


RPGTPfiRRFRRSWFPOT P+FPFPHT F<sT ATPfTTk 
IPAVGLGALGVIPPVRVPQRPPTQRSQGRGW 
DPERDPGCRVQVSRGPRFGEQKTPGLQGCLP 
PPCLTHLAAA SCVVVWCGR WKRDSAECQCD 
HSCSAVSQQEDRCRSSSCS 


655 


2005 


A 


4983 


201 


397 


MNNNTTCIQPSM1SSMALPIIYILLCIVGVFGN 
TLSQWIFLTKIGKKTSTHIYLSHLVTANLLVC 


656 


2006 


A 


4988 


332 


159 


LVHKDMYREFFEEEAQASNKHVTRCLTSLVI 
REVHIKTMR* HFLPIRLEICNKNNIKD 


657 2007 


B 


5008 


129 


465 


MAGMKTASGDYIDSSWELRVFVGbEDPEAES 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteme s 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine ; P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T-Threonine, V^V aline, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 






I 








VTLRVTGESHIGGVLLKIVEQ1NRKQDWSDH 
AIWWEQKRQWLLQl HW 1 LDK Y LilL/VUAJU^r 
FGPQHRPVILRLPNRRALRLX* 


658 


2008 


A 


5017 


1 


292 


FFFFKETESHSVTQAGVQWHDI .GSLQPPPPGF 
KRF SCLSLLSS WD YRC APPHPANF VFLVETGF 

tt-tw-i t * y-v A OT 1/ 7 T TT -fc O A \IT f~* T CTCI RTDT IT IT T C 

HHVAQAGLKLLTL*SANLGLS 1 bLrlrLtlL,Lc> 


659 


2009 


A 


5018 


17 


338 


RGHGGKSLTGGTPGNWGDGLLVSEDWSHLIF 
T*NSLVSPVLGKWSPCLQGPGLSAVHTWPWL 
MAACWAVHVKTHMRPGLAVLPRLVLNSWS 
*AIILLWPPKALGLQA 


660 


2010 


A 


5028 


2 


310 


SRVDDFVGERRGGCDECLCGHRGLRAVPLG 
HPGHLCLQPPGGPA*FLDYCRGCCPHPVPGST 
AGSCPRQKKTTPGPTVLCVCSFWIYQRGEPH 
HRTGARWNH 


661 


2011 


A 


5050 


752 


431 ; 


RQSCSSTQAKVQWFHYGPLQSQPPGLKQSSQ 
LSLPN SRDHRH VPPRL AIFSF AETGSP YFAQ AS 
LELLGSSHPPTSASQSAR1TGVSHRAWPLK*F 
NLNQYQTLTMN 


662 


2012 


A 


5054 


48 


103 


ELNNGPFQMPLCNG GNLAVTG S W ADRSPLH 

EAASQGRLL AI ,RTLLSQG YN VN AVTLDHVTP 

LHEACLGDHVACARTLLEAGANVNAITIDGV 

TPLFNACSQGSPSCAELLLEYGAQAQLESCLP 

SPTHEGASKGHHECLDILISWGIDVDQEIPHSG 

TPLYVACMAQQFHCIWNLIYAGAGVRKGKY 

WDTPLPGAGHQSTQKLE*LFAMVEIWQ 


663 


2013 


A 


5066 


951 


580 


VRNS* SFAHCAS VYKHH YMDGQTPCLF VSSK 
ADLPEGV AVSGPSP AEFCRKHRLPAPVPFSCA 
GPAEPSTTIFTQLATMAAFPHLVHAELHPSSF 
WLRGLLGVVGAAVAAVLSFSLYRVLVKSQ 


664 


2014 


A 


5071 


550 


1 


LSFIEVLSMEQVNKTVYREFVVLGFSSLARLQ 

QLLFVIFLLLYLFTLGTNAIIISTIVLDRALHTP 

MYFFLA1LSCSEICYTFVIVPKMLVDLLSQKK 

T1SFLGCA1QMFSFLFFGSSHSFLLAAMGYDR 

YMAICNPLRYSVLMGHGVCMGLMAAAWAC 

GFTVSLVTTSLVFHLPFHSSNQHE 


665 


2015 


A 


5074 


496 


692 


QQYHNTGSAGHHAHCQVGHSPHVHYPSGCG 
PL*IQRGLPSFNSLEGHSLKDSGHEESVQLDSE 
HDVQRSLYCDTAVNDVLNTSVTSMGSQMPD 
HDQNEGFHCREECRiLGHSDRCWMrKNrMrl 
RSKSPEHVRNI1ALSIEATAADVEAYDDCGPT 
KRTFATFGKDVSDHPAEERPTLKGKRTVDVT 
ICSPKVNSVIREAGNGCEAISPVTSPLHLKSSL 
PTKPSVSYEIVDPGITARRC 


666 


2016 


A 


5080 


408 


24S 


IMLLSTSS*VYFQSSTKDSHFFLFDFQKTGPPL 
VGPKAQLSGLQLQPCLYKRR 


667 


2017 


A 


5081 


129 


247 


DLTNSHFFLFDFQKTGPPLGGPKAQFSSLQLQ 
PCVY*RR 


668 


2018 


A 


5086 


852 


233 


NIKSNDRWVQIKTAYKYFF*KNGDNYNWVF 

RALPTTFADIENLKYLLFTRDASQPFYLGHTV 

IFGDLE Y VTVEGG IVL SRELMKRLNRLLDNSE 

TCADQSVIWKLSEDKQLAICLKYAGVHAENA 

EDYEGRDVFNTICPIAQLIEEALSNNPQQWEG 

CCSDMA1TFNGLTPQKMEVMMYGLYRLRAF 

GHYFNDTLVFLPPVGSEND 


669 


2019 


A 


5101 


1 


329 


PGRPTRPPLLTLLAHVSPEPAGPSCDSLAQPG 
ASGV*VQHDSHPPLLCGSQCLSEPVPGSHGPP 
RGCQHE AAPCPRGPG SDGLHHA S A AC ASLPP 
SPILPVLLPELGPL 


670 


2020 


A 


5102 


3 


| 547 


DAWGNRCAVGAAPRLIHLHLCCTPADPSRKP 
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SEQ ID 

MO- nf 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

peptide 
seq- 
uence 


Met 
hnH 

I1UU 


SEQ 
TD MO- 
in 

USSN 
09/496 
914 


Predicted 
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nucleotide 

location 

correspondi 

ng to first 

nminft arid 

residue of 
peptide 

Olsl] UvllvV 
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nucleotide 
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to last amino 
acid residue 

of* nfrvtirlp 

sequence 


Amino acid sequence (A=AJanine C=Cysteine, 
D=Aspartic Acid, £=01^811110 Acid, j 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine } L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine 5 S=Serine, 
i — i iircuiiiiic, v— vd.nnCj w — i rypit/priaii, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 

liULlCUUIJC lllowl LIU11 














DEL*NMNGRVDYLV-rEEEliNlLTRGPSGLGFNT" 
VGGTDQQYVSNDSGIYVSRIK£NGAAALDGR 
1 OFGDKIT SVMnODT KNT T T40HA VDI FRIsJA 

GYAVSLRVQHRLQVQNGPIGHRGEGDPSGIPI 
FMVLVPVFALTMVAAWAFMRYROOL 

X 1»A V A— » t A V 1 r\X*t 1 I VI T TV iAJ lVJ|i\. 1 l\.V<f V^J_ i 


671 


2021 


A 


5105 


672 


400 


RDGREELCLQQEPTLPSRJCSSAPLLYFLFICPF 
VLLLLLLI SLLCL Y WKARKLSTLRSNTRKEKA 
LWVDLK£AGGVTTNRMED*EEDECN 


672 


2022 


A 


5148 


72 


314 


IIYFSYMFLKITELLNDVERLKQALNGLSQLT 
YTSGNPTKRQSQLIDTLQFJQVKSLEQQLAVS 
NQAHGALQEYVLAPCS 


673 


2023 


A 


5152 


210 


335 


REILCSRIGRLNIV*MSLFPNLTCRLNAIP1KIPA 
NHFVEVT 


674 


2024 


A 


5153 


3 


2953 


LTEDQPFD1LQKSLQEANITEQTLAEEAYLDA 

SIGSSQQFAQAQLHPSSSASFTQASNVSNYSG 

QTLQPIGVTHVPVGASFASNTVGVQHGFMQH 

VGISVPSQHLSNSSQISGSGQIQLIGSFGNHPS 

MMTINNLDGSQIILKGSGQQAPSNVSGGLLV 

HRQTPNGNSLFGNSSSSPVAQPVTVPFNSTNF 

QTSLPVHN1IIQRGLAPNSNKVPINIQPKPIQM 

GQQNTYNVMNI.GIQQHHVQQGISFASASSPQ 

GSWGPHMSVN1VNQQNTRKPVTSQAVSSTG 

GSIVIHSPMGQPHAPQSQFLIPTSLSVSSNSVH 

HVQTINGQLLQTQPSQLISGQVASEHVMLNR 

NSSNMLRTNQPYTGPMLNNQNTAVHLVSGQ 

TFAASGSPVIANHASPQLVGGQMPLQQASPT 

VLHLSPGQSSVSQGRPGFATMPSVTSMSGPSR 

FPAVSSASTAHPSLGSAVQSGSSGSNFTGDQL 

TQPNRTPVPVSVSHRLPVSSSKSTSTFSNTPGT 

GTQQQFFCQAQKKCLNQTSPISAPKTTDGLR 

QAQIPGLLSTTLPGQDSGSKVISASLGTAQPQ 

QEKWGSSPGHPAVQVESHSGGQKRPAAKQ 

LTKGAFILQQLQRDQAHTVTPDKSHFRSLSD 

AVQRLLSYHVCQQSMPTEEDLRKVDNEFETY 

ATQLLKRTQAJVILNKYRCLLLEDAK'IRINPPAE 

MVMIDRMFNQEERASLSRDKRLALVDPEGFQ 

AJJrttir i^lJK-AAJ-jLb 1 Qr GRoDQHubKASSS 

LQPPAKAQGRDRAICTGVTEPMNHDQFHLVP 

NHIWSAEGNISKKTECLGRALKFDKVGLVQ 

YQSTSEEKASRREPLKASQCSPGPEGHRKTSS 

RSDHGTESKLSS1LADSHLEMTCNNSFQDKSL 

RNSPKNEVLHTDIMKGSGEPQPDLQLTKSLET 

TFKNILELKKAGRQPQSDPTVSGSVELDFPNF 

SPMASQENCLEKFIPDHSEGVVETDSILEAAV 

NSILEC 


675 


2025 


A 


5154 


599 


1880 


LKKMEPFSCDTFVALPPATVDNRIIFGKNSDR 

LYDEVQEVVYFPAVVHDNLGERLKCTYIE1D 

QVPETYAVVLSRPAWLWGAEMGANEUGVCI 

GNEAVWGREEVCDEEALLGMDLVRLGLERA 

DTAEKALNVIVDLLEKYGQGGNCTEGRMVF 

SYHNSFLIADRNFAWIT FTAfiKYWA AFKY/OF 

GVRMSNQLSITTKIAR£HPDMRNYAJCRKGW 

WDGKKEFDFAAAYSYLDTAKMMTSSGRYCE 

GYT<XLNKHKGNITFETMME1LRDKPSGINME 

GEFLTTASMVFILPQDSSLPCIHFFTGTPDPER 

S VFKPFIFVPHIS QLLDTS SPTFELEDL VKKK S 

OTKPDRRHPLYQKHQQAXEVVNNINEEKAKI 

MLDNNIRKLEKELFREMES1LQNKHLDVEKIV 

NLFPQCTKDEIQIYQSNLSVKVSS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid. E=Glutamic Acid, 
^Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine 3 K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W^Tryptophan, 
Y=Tyrosinc, X=Unknown 5 *=Stop codon, 
/=possible nucleotide deletion, \=possibIe 
nucleotide insertion 


676 


2026 


A 


5155 


2 


306 


FFFLRRSLALSPRPDCULQWRNLGSLQAPPPG 
FTPFSCLSLPSS WDYRRPPPRPANFLYF* *RRG 
FTLL ARM V bib* PHDr r Ab Ab yb Abi luv orixvA 
RPT 


677 


2027 


A 


5167 


97 


740 


FFHSVDLLALEQSKTFYKPDWFDIVESEVKCC 

KEAVCV1DMSSFTEFEITSTGDQALEVLQYLF 

SNDLDVPVGHIVHTGMLNbuuuYblNJJUbiAK 

LNKRSFFMISPTDQQVHCWAWLKKI-IMPKDS 

NLLLEDVTWK YTALNLIGPRAVDVLSELS YA 

PMTPDHFPS LFCKEMS VG Y ANGIRVMSMTHT 

GEPGFMLYIPIEYRWGFTMLSTLVSNS 


678 


2028 


A 


5183 


1919 


2018 


PALCRLRDDMTVC V ADFGLbKKl Y buD Y Y Ky 
GRI AKMP VK WI AIE SL ADRV YTSKSD V W AFG 
VTM WEI ATRGMTP YPG VQNHEM YD YLLHG 
HRLKQPEDCLDELCKI* *SPQSP 


679 


2029 


A 


5190 


39 


499 


RESQVKHFKMRKIDLCLSSEGSEVILATSSDE 
KHPPEN1IDGNPETFW1 1 luMrFQJbMlL-rJ-nsJ-i 
VRIERLV1QSYFVQTLKIEKSTSKEPVDFEQWI 
EKDLV1 1TEGQLQNEEI V AI IDG SAT YLRFUVS 
AFDHFASVHSVSAEG1 VVbNLbb 


680 


2030 


A 


5204 


541 


92 


E1LAVLKI.ACGDISLNALALMVATAVLTLAPL 
LLICLSYLFILSAILRVPSAAGRCKAFSTCSAH 
RTVVV V FYGTISFMYr KJ^KArUJrJN v UK. i v/vl. 
FYGVVTPSLNPIIYSLRNAEVKAAVLTLLRGG 
LLSRKASHCYCCPLPLSAGIG 


681 


2031 


A 


5207 


10 


247 


VPDNGDVTKLPVCSTLVEETSLTVSEAMEQSI 
KNESPLPGTLAHTCNTb 1 LUUKUKw i l UKJir 
DTSMANMVKPCLYRK 


682 


2032 


A 


5210 


2 


231 


FFFETESYSITQAGVQWPNLSSLKTLPPGFK*F 
SCLSLPSSWDYRCLPPCrANruirbKiNuvLJ u 
WPGWSRTPDLS 


683 


2033 


A 


5218 


85 


402 


CPSVSGLIICSDLRRHNINIGli N VU Vi<LA v biN lr 
M1ILLRSMYRJNVKPYFFI*LFFSRWC*SVIIG 
YARCYTFLIF*LFL* IPADSPTDQEPKTVMLSK 
QSESAI 


684 


2034 


A 


5220 


1 


194 


NLMKEMQNLNSENHKTWEEYKDTK* IMS YF 
YG* ALNVIKM AVLPJvLM 1 Kr bA 1 L v isaryiiu 
TDS 


685 


2035 


A 


5228 


260 


440 


LHSQDGNSDPRKPQGEMSAHAFPVQTCGEED 
QKKTPQVPINFTELSKCS* S*KIMSGERE 


686 


2036 


A 


5239 


79 


508 


GGEAAARAAKLSSPRPHRVGRRERGVGGMS 
AFSEAALEKIvLSELSNSQQb VQ 1 EbJ_ WLitu-LK 
KHSRPIVIV^ERELRKAKPNRKLTFLYLAND 
VIQNSKRKGPEFTKDFAPVIVEAFKHVSSETD 
ES CIGCHLGRVL SI WEERS 


687 


2037 


A 


5244 


1 


428 


" MAAVVAATALKGRGARNARVLRGILAGATA 
NKASHNRTRALQSHSSPEGKEEPEPLSPELEYI 
PRKRGKNPMKAVGLAWA1GFPCGILLFILTKR 
EVDKDRVKQMKARQNMRLSNTGEYESQRFR 
ASSQSAPSPDVGSGVQT 


688 


2038 


A 


5249 


1 




LQQTEDKSLLNQGSSSEEVAGSSQKMGQPGP 

SGDSDI , ATAEHRLSLRRQNYLSEKQFFAEEW 

QRK1QVLADQKEGVSGCVTPTESLASLCTTQS 

EITDLSSASCLRGFMPEKLQIVKPLEGSQTLY 

HWQQLAQPNLGTILDPRPGVITKGFTQLPGD 

AIYHISDLEEDEEEGITFQVQQPLEVEEKLSTS 

KPVTGIFLPPITSAGGPVTVATANPGKCLSCT 

NSTFTFTTCRTEHPSDITQVTPSSGFPSLSCGSS 

GSSSSNTAVNSPALAYRJLSIGESri-NRRDSTTT 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 



SEQ ID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Giutamic Acid, 
F=Phenylalaninc a G=Glycine 5 H=Histidine, 
I=Iso!eucine. K==Lysine, L=Leucine, 
M^Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=l Yyptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 



689 



2039 



5254 



2621 



FS STMSLAKLLQERG I S A K V YHSPI SENPLQPL 

PKSLATPSTPPNSPSHSPCPSPLPFEPRVHLSEN 

FLASRPAETFLQEMYGLRPSRNPPDVGQLKM 

NLVDRLKRLGIARVVKNPGAQENGRCQEAEJ 

GPQKPDSAVYLNSGSSLLGGLRRNQSLPVIM 

GSFAAPVCTSSPKMGVLICED 



690 



2040 



5261 



304 



LSLFGSRALGRSGARAMAICAKKVGARRKAS 
GAPAGARGGPAKANSNPFEVKVNRQKFQILG 
RXTRHDVGLPGVSRARALRIvRTQTLLKEYKE 
RDKSNVFRDKRFGEYNSNMSPEEKN4MKRFA 
LEQQRHHEKKSIYNLNEDEELTHYGQSLADIE 
KHNDIVDSDSDAEDRGTLSGELTAAHFGGGG 
G1XHKKTQQEGEEREKPKSRKELIEELIAKSK 
QEKRERQAQRED ALELTEKLDQD WKEIQTLL 
SHKTPKSENRBKKEKPKPDAYDMMVRELGF 
EMKAQPSNRMKTEAELAKEEQEHLRKLEAE 
RLRRMLGKDEDEN VKKPKHM S ADDLNDGF V 
LDKDDRRLLSYKDGKMNVEEDVQEEQSKEA 
SDPESNEEEGDSSGGEDTEESDSPDSHLDLES 
NVESEEENEKPAKEQRQTPGKGLISGKERAG 
KATRDELPYTFAAPESYEELRSLLLGRSMEEQ 
LLVVERIQKCNHPSLAEGNKAKLEKLFGFLLE 
YVGDLATDDPPDLTVIDKLVVHLYHLCQMFP 
ESASDAIKFVLRDAMHEMEEMIETKGRAALP 
GLDVUYLKITGLLFPTSDFWHPVVTPALVCL 
SQLLTKCPILSLQDVVKGLFVCCLFLEYVALS 
QRFIPELINFLLGILYIATPNKASQGSTLVHPFR 
ALGKNSELLVVSAREDVATWQQSSLSLRWA 
SRLRAPTSTEANHIRLSCLAVGLALLKRCVLM 
YGSLPSFHAIMGPLRALLTDHLADCSHPQELQ 
ELCQSTLTEMESQKQLCRPLTCEKSKPVPLKL 
FTPRLVKVLEFGRKQGSSKEEQERKRJLIHKHK 
RE FKG A VREI RKDNQFLARM QLSE1MERD AE 
RKJUCVKQLFNSLATQEGEWI<ALKRKKFKJC 



FFFFVFLVETGFHHVGQAGLELLTSGDPPTW 
ASQSAGITGVSHCSWPVIYVLSTLLHAVRNVL 
FKRTFPLKSSSFLSYDKE1FPIUVLKFYLVTLT 
SFVK 



691 



2041 



692 



2042 



693 



2043 



5270 



158 



5282 



56 



1268 



NCHTTHCTANWVHLPGTPPGWKIDGPAAAL 
EVLS SFFFFFLKFS YKPQNI V 



5301 



362 



GMEPVGCCGECRGSSVDPRSTFVLSNLAEVV 

ERVLTFLPAKALLRVACVCRLWRECVRJWLR 

THRS VT WIS AG LAE AGHLEGHCL VR W AEEL 

ENVRJLPHTVLYMADSETFISLEECRGHKRAR 

KRTSMETALALEKLFPKQCQ\0,GIVTPGIVVT 

PMGSGSNRPQEIEIGESGFALLFPQIEGIKIQPF 

HFIKDPKNLTLERHQLTEVGLLDNPELRVVLV 

FGYNCCKVGASNYLQQVVSTFSDMNULAGG 

QVDNLSSLTSEKNPLDIDASGWGLSFSGIIRI 

QSATVLLNEDVSDEKTAEAAMQRLKAANIPE 

HNTIGFMFACVGRGFQYYRAKGNVEADAFR 

KFFPSVPLFGFFGNGEJGCDRIVTGNFJLRXCN 

EYKDDDLFHSYTTIMALIHLGSSK 



507 



EEIKERPGPGLVIYWYGF1QELDCNRERGILLK 
ACFPTNIVTLCHSIA 



694 



2044 



695 



2045 



5310 



204 



RVLTAINHTLKENLRKFYKGKKDKPLDLRPK 
KTRAMRRRLNMHEENLKTKKQHRKERLYPL 
RKYAAKA 



5315 i 125 



1596 



ETRSTAVKSEVQVCISLLLCLEDRTMP1CKAKP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, b==ulutamic /\cia, 
F=Phenylalanine, G=Glycine, H==Histidine, 
I=Isoleueine, K=Lysine, L=Leucine 5 
M=Methionine ; N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon. 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














TGSGKEEGPAPCKQMKLEAAGGPSALNFDSP 

SSLFESLISPIKTETFFKEFWEQKPLLIQRDDPA 

LATYYGSLFKLTDLKSLCSRGMYYGRDVNV 

CRCVNGKKXVLNKDGKAHFLQLRKDFDQKR 

ATIQFHQPQRFKDELWRIQEKLECYFGSLVGS 

NVYITPAGSQGLPPHYDDVEVFILQLEGEICH 

WRLYHPT VPLARE YS VEAbbKI UKr v tver ML 

KPGDLLYFPRGTIHQADTPAGLAHSTHVTIST 

YQNNSWGDFLLDTISGLVFDTAKEDVELRTG 

IPRQLLLQVESTTVATRRLSGFLRTLADRLEG 

TKEIXSSDMKKDFIMHRLPPYSAGDGAELSTP 

GGKLPRLDSWRLQFKDH1VLTVLPDQDQSD 

ETQEKMVYIYHSLKNSRETHMMGNEEETEFH 

GLRFPLSHLD ALKQIWN SP AI S VKDLKLTTDE 

EKESLVLSLWTECLIQVV 


696 


2046 


A 


5318 


1476 


742 


LMKXYLEAAELGEISDIHTKLLRLSSSQGTIET 
SLQDIDSRLSPGGSLADAWAHQEGTHPKDRN 
VEKEQVLLNCMTEl Y Y Qr RRL)KAiiKKlv/\ i jn 
EEQIHKFDKQKLYYHATKAMTHFTDECVKK 
YEAFLNKSEEWIRKMLHLRKQLLSLTNQCFDI 
EEE VSKYQEYTNELQETLPQKMFTAS SGIKHT 
MTPIYPSSNTLVEMTLGMKKLKEEMEGVVKE 
LAENNHILESGGSL 1 MDUOLKiN V U<^b 


697 


2047 


A 


5320 


244 


478 


LDYNFFLFEMTFGLVSQAGVQWHDLGSLQPP 
PPGFKQFSCLSLPSSWDYRHLPPHLANFSREG 
VSPSWPGWSRTPDFR 


698 


2048 


A 


5324 


266 


714 


LPIRKSLRSVRSGFPTSQSPITRNLDGTASGSC 
LAKTVTGSLFRINVGLRIjLV AUOIIu/u^llt i y 
VGGLLMAFQKYSGETVQERKQKDRICALHEL 
KEEEWKGRLQVTEHLPEKIESSLQEDEPENDA 
KXIEALLNLPRNPS VIDKQDKD 


699 


2049 


A 


5334 


699 


277 


RPHGHLVCISSSAGLSGVNGLADYCASKFAA 
FGFAESVFVETFVQKQK.GIKTTIVCPFFIKTGM 
FEGCTTGCPSLLPILEPKYAVEK1VEAILQEKM 
YXYMPKLLYFMMFLKbr LrLK 1 ULLIAU i lui 
LHAMDGFADQKK 


700 


2050 


A 


5344 


3 


614 


PTAEEMSSLTPESSPELABCRSWFGNFISLDKEE 

QBFLVLKDKPLSS1KADIVHAFLSIPSLSHSVLS 

QTSFRAEYKASGGPSVFQKPVRFQVDISSSEG 

PEPSPRRDGSGGGGIYSVTFTLISGPSRRFKRV 

VETIQAQLLSTHDQPSVQALADEKNGAQTRP 

AGAPPRSLQPPPGRPDPELSSSPRRGPPKDKK 

LLATNGTPL 


701 
702 


2051 
2052 


A 
A 


5346 
5356 


3 

1 2502 


13S3 
1540 


HASVLFCRVMAASKTQGAVARMQEDRDGSC 
STVGGVGYGDSKDCILEPLSLPESPGGTTTLE 
GSPSVPCIFCEEHFPVAEQDKLLKHMIIEHKIV 
IADVKLVADFQRYILYWRKRFTEQPITDFCSV 
ERTNSTAPFEEQENYFLLCDVLPEDRILREELQ 
KQRLRE1LEQQQQERNDTNFHGVCMFCNEEF 
LGNRS\nLNHMAREHAFNIGLPDNIVNCNEFL 
CTLQKKLDNLQCLYCEKTFRDKNTLKDHMR 
KKQHRKINPKNREYDRFYVINYLELGKSWEE 
VQLEDDRELLDHQEDDWSDWEEHPASAVCL 
FCEKQAETIEKLYVHMEDAHEFDLLK1KSELG 
LNFYQQVKLVNFIRRQVHQCRCYGCHVKFKS 
BCADLRTHMEETKHTSLLPDRKTWDQLEYYFP 
TYENDTLLWTLSDSESDLTAQEQNENVP11SE 
DTSKLYALKQSSILNQLLL 
- MAAATRGCRPWGSLLGLLGL VS AAAAAWD 
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NO: of 
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seq- 
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I SEQID 
NO: of 
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seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

IJSSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcine, 
D=Aspartic Acid. E— Glutamic Acid, 
F^Phenylaianine, G=Glycine, H=Histidine, 
f=Iso leucine, K=Lysine, L=Leucine, 
M^Methionine, N=Asparagine_, P=Proline ; 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine s V= Valine, W=Tryptophan ; 
Y-Tyrosine, X=Unkno\vn : *=Stop codon, 
/possible nucleotide deletion, \=possible 
nucleotide insertion 










* 




LASLRCTLGAFCECDFRPDLPGLECDLAQHL 

A J~\. TIT A T* w A T \ TX TTS A t TS" A W^X T 1*1 T"\ T1 A PlT^fV l"1 T T /I OF 

AGQHLAJCALVvkALKAPVRDPAPTKPL 

HGWTGTGKSYVSSLLAHYLFQGGLRSPRVH 

HFSPVLHFPHPSHIERYKICDLKSWVQGNLTA 

CGRSLFLFDEMDKMPPGLMEVLRPFLGSSWV 

VYGTN YRKAIFIF1SNTGGEQINQ VALEA WRS 

RI\DREEILLQELEPVISRAVLDNP11HGFSNSG1 

MEERLLDA V VPFLPLQRI-n fVRHC VLNELAQL 

GLEPRDEWQAVLDSTTFFPEDEQLFSSNGCK 

TVASRIAFFL 


703 


2053 


A 


5380 


278 


657 


LFLQKLRMKTEEEARTHTEIEMFLRKJEQQKLL 

EERLEFWMEKYDKDTEMKQNELNALKATKA 

SDLAHLQDLAKMIREYEQVIIEDRIEICERSKK 

KVKQDLLELKSVI1CLQAWWRGTMIRREIGGF 

KM 


704 


2054 


A 


5381 


1 


1003 


FRGRAVKMAAVVEVEVGGGAAGERELDEV 

DMSDLSPEEQWRWHARMHAKHRGHEAMH 

AEMVLILIATLVVAQLLLVQWKQRHPRSYN 

MVTLFQMWVVPLYFTVKLHWWRFLV1WILF 

SAVTAFVTFRATRKPLVQTTPRLVYKWFLLIY 

K1S Y ATGI VG YMAVMFTLFGLNLLFK1 KPED A 

MDFGISLLFYGLYYGVLERDFAEMCADYMA 

STIGFYSESGMPTKHLSDSVCAVCGQQIFVDV 

SEEGIIENTYRLSCNHVFHEFCIRGWCIVGKK 

QTCPYCKEKVDLKRMFSNPWERPHVMYGQL 

t r\i \ rr i*\ *\ rt XT a wr /"*\ r%x ftio t tt t r\ tx rit s~\ i t~» 

LDWLRYLVAWQPVIIGVVQGINY1LGLE 


705 


2055 


A 


5396 


3 


675 


IYDRDPLQLATRAGQPLDINMAGEPKPYRPKP 

GNKRPLSALYRLESKEPFLSVGGYVFDYDYY 

RDDFYNRLFDYHGRVPPPPRAV1PLKRPRVA 

VTTTRRGKGVFSMKGGSRSTASGSTGSKLKS 

DLLQ I lKJsJbL 1 KIDSVLGRLDKIEKQQK 

AEAEAQKKLLEESLVLIQEECVSEIADH STEEP 

AEGGPDADGEEMTDGIEEAFDEDGGHELFLQ 

TTX 


706 


2056 


A 


5410 


2 


98 


GRVGLNLEGRGCSEPKWRHCTPTWATEQDSI 
S 


707 


2057 


A 


5415 


6 


287 


PFKLTPSFLSHAFSSGQERKVFIELNHIKKCNT 
VkGVFVLEEFGNYTILLLGLDSHGSNSNLGAP 
EEGLGAGRKRTSVEKSGGAGVTRKKRDP 


708 


2058 


A 


5423 


3 


291 


SSSNPLGSPSTLWKLCSFVLHNKSCCCSFFGS 

TPTLRAITLTVRVCGFIPEVSKTTNPLGRTNNS 

GCTIFKTVTLTARSTASLLKSVRPRTHQKE 


709 


2059 


A 


5424 


679 


347 


RJRHEEKRGSRGRGRRTSEEDTPKXKKHKGG 
SEFTDT1LSVHPSDVLDMPVDPNEPTYCLCHQ 
VSYGEMIGCDNPDCPIEWFHFACVDLTTKPK 
GKWFCPRCVQEKRKKK 


710 | 


2060 


A 


5442 


1073 


559 


QESLKKK1QPKLSLTLSSSVSRGNVSTPPRHSS 
GSLTPPVTPPITPSSSFRSSTPTGSEYDEEEVDY 
EESDSDESWTIESAISSEAILSSMCMNGGEEK 
PFACPVPGCKKRYKNVNG1KYHAKNGHRTQI 

i\ V iNJSJr rivv^i\.\_,vjJVO 1 1\ 1 /W^vJl^lvi T.1 1 i 1INI I ill v 

SAEIIRKMQQ 


711 


2061 


A 


5449 


1 


319 


GDSLCWQYNKYREERVILFLKMASGHAFQP 
DLVKRIRDAIRMGLSARHVPSL1LETKGIPYTL 
NGKKVEVAVKQIIAGKAVEQGGAFSNPETLD 
LYRDIPELQGF 


712 


2062 


A 


5499 


91 


749 • 


RPTPGHGDFWMQPLTKDAGMSLSSVTLASAL 
QVRGEALSEEEIWSLLFLAAEQLLEDLRNDSS 
DYVVCPWSALLSAAGSLSFQGRVSHIEAAPF 
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Predicted 
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nucleotide 
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ng to first 
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Predicted end 
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location 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D-Aspartic Acid, E-Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
l=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
T=Threonine 5 V-Valine, W=Tryptophan 3 
Y=Tyrosine, X=Unknown s *^Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














KAPELLQGQSEDEQPDASQMH v y SLUM l JL y 
WSAGFHVPPHQPLQLCEPLHSILLTMCEDQPH 
RRCTLQSVLEACRVHEKEVSVYPAPAGLHIR 
RLVGL VLGTI SE VSREPCFSb bbC W ^ C V AlisJ 


713 


2063 


A 


5506 


22 


478 


VEELIL V SRLDPHLHTPM YFFLAHLSFLDL SFT 
TSSIPQLLYNLNGCDKTISYMGCAIQLFLFLGL 
GGVECLLLAVMAYDRCVAICKPLHYMNTMN 
PRLCRGLVSVTWGCGVANSLAMSPVTLRLPR 
CGHHEVDHFLCEMPALIRMACISTV 


714 


2064 


A 


5514 


25 


220 


"AIRPYWCENNIIGIGKLSTADGKAFADPEVLR 
RLTSSVSCALDEAAAALTRMRAESTANAGQS 
DK 


715 


2065 


A 


5526 


3 


810 


KVTAPRRPQRYSSGHGSDNSSVLSGELPPAM 

GRTALFHHSGGSSGYESLRRDSEATGSASSAP 

DSMSESGAASPGARTRSLKSPKKRATGLQRR 

RLIPAPLPDTTALGRKPSLPGQWVDLPPPLAG 

SLKEPFEIKVYEIDDVERLQRPRPTPREAPTQG 

LACVSTRLRLAERRQQRLREVQAKHKHLCEE 

LAETQGRLMLEPGRWLEQFEVDPELEPESAE 

YLAALERATAALEQCVNLCKAHVMMVTCFD 

1SVAASAAIPGPQEVDV 


716 


2066 


A 


5529 


458 


790 


SPGYGENKFTVTSXNIAVPLCEMNKIYSYYSD 
SSSSERTMDLVLEMCNTNSIHWCGISGRQLG 
KLHPS SSLCLALTLLS S VQGLQSISGLRLTDTF 
LKRTYEYDDIAQVCV 


717 


2067 


A 


5531 


3 


460 


NSEDLLKYFNPESWQEDLDNMYLDTPRYRG 
RS YHDRKSK VDLDRLNDD AKRYSCTPRNYS 
VNIREELKLANVVFFPRCLLVQRCGGNCGCG 
TVNWRSCTCNSGKTVKKYHEVLQFEPGHIKR 
RGRAKTMALVDIQLDHHERCDCICSSRPPR 


718 


206S 


A 


5586 


311 


88 


AVLKNMAPMTALGLLDLHILNLILFLSAGEDF 

TSWSE1MMYILLVFLTLWLLIEMIYCYRKVS 

KAEEAAQENA 


719 


2069 


A 


5598 


1 


330 


KNCANEAWQKILDRVLSRYDVRLRPNFGSM 
L ATN S TRGLNEDELMAHGQEKD S SSESED SC 
PPSPGCSFTEGFSFDLLNPDYVPKVDkWSRFL 
FPL AF GLFNI VAAERC 


720 


2070 


A 


5628 


798 


148 


" LPPAQIPEAWLLLANWVVLILVPLKDRLIDP 
LLLRCKLLPSALQKMALGMFFGFTSVIVAGV 
LEMERLHYIHHNETVSQQIGEVLYNAAPLSIW 
WQIPQYLLIGISEIFASIPGLEFAYSEAPRSMQG 
AIMGIFFCLSGVGSLLGSSLVALLSLPGGWLH 
CPKDFGNINNCRMDLYFFLLAGIQAVTALLF 
VWIAGRYERASQGPASHSRFSRDRG 


721 


2071 


A 


5632 


146 


536 


" MSALIVRKLRSAELTLFSELPTVLGANVNAA 
KJ^HET ALHH AAK VKNVDLIEML IEF G GNI Y A 
RDNRGKKPSDYT W S S S AP AKCFEYYEKTPLT 
LSQLCRVNLRKATGVRGLEKIAKLN1PPRLID 
YLSYN 


722 


2072 


A 


5638 


3 


3806 


CPSLDIRSEVAELRQLENCSWEGHLQILLMF 

T A TGFnFRGLSFPRLTO VTD YLLLFRVYGLES 

LRDLFPNLAVIRGTRLFLGYALVIFEMPHLRD 

VALPALGAVLRGAVRVEKNQELCHLSTIDW 

GLLQPAPGANHIVGNKLGEECADVCPGVLGA 

AGEPCAKTTFSGHTDYRCWTSSHCQRVCPCP 

HGMACT ARGECCHTECLG G CSQPEDPRAC V 

ACRHLYFQGACLWACPPGTYQYESWRCVTA 

ERCASLHSVPGRASTFGTHQGSCLAQCPSGFT 

RNSSSIFCHKCEGLCPKECKVGTKTIDS1QAA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peplide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=PhenyIalanine, G^Glycine, H^Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














QDLVGCTHVEGSLILNLRQGYNLEPQLQHSL 

GLVETITGFLK1KHSFALVSLGFFKNLICLIRGD 

AMVDGNYTLYVLDNQNLQQLGSWVAAGLTI 

PVGKIYFAFNPRLCLEHIYRLEEVTGTRGRQN 

KAEINPRTNGDRAACQTRTLRFVSNVTEADRI 

LLRWERYEPLEARDLLSFIVYYICESPFQNATE 

HVGPDACGTQSWNLLDVELPLSRTQEPGVTL 

ASLKP WTQ Y A VF VRAITLTTEEDSPHQG AQS 

PIVYLRTLPAAPTVPQDVISTSNSSSHLLVRW 

KPPTQRNGNLTYYLVLWQRLAEDGDLYLND 

YCHRGLRLPTS7STNDPRFDGEDGDPEAEMESD 

CCPCQHPPPGQVLPPLEAQEASFQKKFENFLH 

NAITIPISPWKVTSINKSPQRDSGRHRRAAGPL 

RLGGNSSDFEIQEDKVPRERAVLSGLRHFTEY 

RJDIHACNHAAFITVGCSAATFVFARTMPHRE 

ADGIPGKVAWEASSKNSVLLRWLEPPDPNGL 

ILKYEIKYRRLGEEATVLCVSRLRYAKFGGV 

HL ALLPPGN YS AR VRATSLAGNG SWTDSVAF 

YILGPEEEDAGGLHVLLTATPVGLTLLIVLAA 

LGFFYGKKRNRTLYASVNPEYFSASDMYVPD 

EWEVPREQISIIRELGQGSFGMVYEGLARGLE 

AGEESTPVALKTVNELASPRECIEFLKEASVM 

KAFKCHHVVRLLGVVSQGQPTLVIMELMTR 

GDLKSHLRSLRPEAENNPGLPQPALGEM1QM 

AGEIADGMAYLAANKFVHRDLAARNCMVSQ 

DFTVKIGDFGMTRDVYETDYYRKGGKGLLP 

VRWMAPESLKDGIFTTHSDVWSFGVVLWEIV 

TLAEQPYQGLSNEQVLKFVMDGGVLEELEGC 

PLQLQELMSRCWQPNPRLRPSFTHILDSIQEEL 

RPSFRLLSFYYSPECRGARGSLPTTDAEPDSSP 

TPRDCSPQNGGPGH 


723 


2073 


A 


5672 


1 


216 


LAWIDNILPEKEKKETDKKRKRKKGAHEDCD 

EEPQFPPPSVIKIPMESVQSDPQNGIHCIARKR 

SSSWSYSL 


724 


2074 


A 


5704 


4235 


940 


ARGRRSRJPVWAASWGGRGRPAARRRPRGLA 

ATMGFELDRFDGDVDPDLKCALCHKVLEDP 

LTTPCGHVFCAGCVLPWVVQEGSCPARCRGR 

LSAICELNI I VLPLKRLILKLDIKCAYATRGCGR 

VVKJLQQLPEHLERCDFAPARCRHAGCGQVLL 

RRDVEAHMRDACDARPVGRCQEGCGLPLTH 

GEQRAG G HCCARALRAHNGALQARLGALHK 

ALICKEALRAGKREKSLVAQLAAAQLELQM'r 

ALRYQKKFTEYSARLDSLSRCVAAPPGGKGE 

ETKSLTLVLIIRDSGSLGFNIIGGRPSVDNIIDG 

SSSEGIFVSKIVDSGPAAKEGGLQIHDRIIEVN 

GRDLSRATHDQAVEAFKTAKEPIVVQVLRRT 

PRTKMFTPPSESQLVDTGTQTDITFEHIMALT 

KMSSPSPPVLDPYLLPEEHPSAHEYYDPNDYI 

GDIHQEMDREELELEEVDLYRMNSQDKLGLT 

VCYRTDDEDDIGIYISE1DPNSIAAKDGRIREG 

DRIIQINGIEVQNREEAVALLTSEENKNFSLLI 

ARAELQi .DEG WMDDDRNDFLODLHMDMLE 

EQHHQAMQFTASVLQQICKHDEDGGTTDTAT 

ILSNQHEKDSGVGRTDESTRNDESSEQENNG 

DDATASSNPLAGQRKLTCSQDTLGSGDLPFS 

NESFISADCTDADYLGIPVDECERFRELLELK 

CQVKSATPYGLYYPSGPLDAGKSDPESVDKE 

LELLNF.ELRSTELECLSIVRAHKMQQLICEQYR 

ESWMLHNSGFRNYNTSIDVRRHELSDITELPE 

KSDKDSSSAYNTGESCRSTPLTLEISPDNSLRR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, j 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G-=Glycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M^Methionine, N=Asparagine, P^Proline, 
Q=Giutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possibIe 
nucleotide insertion 














AAEGISCPSSEGAVGTTEAYGPASKNLLSITE 

DPEVGTPTYSPSLKELDPNQPLESKERRASDCj 

SRSPTPSQKEGSAYLPSYHHSPYKHAHIPAHA 

QHYQSYMQLIQQKSAVEYAQSQMSLVSMCK 

DLSSPTPSEPRMEWKVKIRSDGTRYITKRPVR 

DRLLRERALKIREERSGMTTDDDAVSEMKM 

GRYWSKEERKQHLVKAKEQRRRREFMMQSR 

LDCLKJEQQAADDRXEMNILELSHKKMMKKR 

NI<CKIFDN\VMTIQELLTHGTKSPDGTRVYNSP 

LSVTTV 


725 


2075 


A 


5707 


3 


1770 


QISTEVSEAPVANDKPKTLWKVQKKAADLP 

DRDTWKGRFDFLMSCVGYA1GLGNVWRFPY 

LCGKNGGGAFLIPYFLTLIFAGVPLFLLECSLG 

QYTSIGGLGVWKLAPMFKGVGLAAAVLSFW 

LNIYYIVIISWAIYYLYNSFTTTLPWKQCDNP 

WNTDRCFSNY SM VNTTNMTS AV VEF WERN 

MHQMTDGLDKPGQTRWPLAITLAIAWILVYF 

CIWKGVGWTGKVVYFSATYPYIMLIILFFRGV 

TLPGAKEGILFYITPNFRKLSDSEVWLDAATQ 

IFFSYGLGLGSL1ALGSYNSFHNNVYRDSIIVC 

CINSCTSMFAGFVTFSIVGFMAHVTKRSIADV 

AASGPGLAFLAYPEAVTQLPISPLWAILFFSM 

LLMLGIDSQFCTVEGFITALVDEYPRLLRNRR 

ELFIAAVCIISYL1GLSN1TQGGIYVFKLFDYYS 

ASGMSLLFL VFFEC V SI S WF YG VNRF YDNIQE 

MVGSRPCIWWKLCWSFFTPIIVAGVFIFSAVQ 

MTPLTMGNYVFPKW GQGVG WLMALbbM VL 

IPGYMAYMFLTLKGSLKQRIQVMVQPSEDIV 

RPENGPEQPQAGSSTSKEAYJ 


726 


2076 


A 


5711 


156 


423 


PRRDPGRTPELRGSAPRKTGANMPVRRGHVA 

PQNTFLGTIIRKFEGQNICKFIIANARVQNCAII 

YCNDGFCEMTGFSRPDVMQKPCTCD 


727 


2077 


A 


5716 


3 


274 


HASEYFFKLCSFQVFLSFPLATIVIDVGLVVIP 
LVKSPNVHYVYVLLLVLSGLLFYIPLTHFKIRL 
AWFEKMTCYLQLLFNICLPDVSEE 


728 


2078 


A 


5737 


1899 


649 


IQASRASPYPRVKVDFALSCHEDLLAPISEPIE 

WKYHSPEEEISLGPACWLWDFLRRSQQAGFL 

LPLSGGVDSAATACLIYSMCCQVCEAVRSGN 

EEVLADVRTIVNQISYTPQDPRDLCGRILTTC 

YMASKNSSQETCTRARELAQQIGSHHISLNID 

PAVKAVMGIFSLVTGKSPLFAAHGGSSRENL 

ALQNVQARIRMVLAYLFAQLSLWSRGVHGG 

LL VLG S ANVDESLLG YLTK YDC S S ADINPIGG 

ISKTDLRAFVQFCIQRFQLPALQSILLAPATAE 

LEPLADGQVSQTDEEDMGMTYAELSVYGKL 

RKVAKMGPYSMFCKLLGMWRHICTPRQVAD 

K VKRFFSKY SMNRHKMTTLTP A YHAEN Y SPE 

DNRFDLRPFLYNTSWPWQFRCIENQVLQLER 

AEPQSLDGVD 


729 


2079 


A 


5741 


1 


5976 


PGCAARLSRARAPGPGAAGAGRKRLADPGPP 
pa«?pt?T RAPGSRPRT APCTRRAAOPAHARMA 
PRAAGGAPLSARAAAASPPPFQTPPRCPVPLL 
LLLLLGAARAGALEIQRRFPSPTPTNNFALDG 
AAGTVYLAAVNRLYQLSGANLSLEAEAAVG 
PVPDSPLCHAPQLPQASCEHPRRLTDNYNKIL 
QLDPGQGLVWCGSIYQGFCQLRRRGN1SAV 
AVRFPPAAPPAEPVTVFPSMLNVAANHPNAS 
TVGLVLPPAAGAGGSRLLVGATYTGYGSSFF 
PRNRSLEOHRFENTPEIAIRSLDTRGDLAKLFT 
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SEQ ID 
NO- nf 
nucl- 
eotide 
seq- 
uence 


SEQ ID 

NO nf 

peptide 
seq- 
uence 


Met 

hori 
uuu 


SEQ 

1JL/ JNU. 

in 

USSN 
09/496 
914 


Predicted 

n i^ci inn in r» 

uL/ginninij 
nucleotide 
location 
correspond] 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AJanine C=Cysteme, 
D^Aspartic Acid ; E=Glutamic Acid, 
F=PhcnyIalanine, G=GIycine, Il=IIistidine, 
Hlsoleucine, K=Lysine : L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine 7 S=Serine, 
T= Threonine.. V= Valine, \V=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














FDLNPSDDNILKIKQGAK^QHKLGFVSAFLHP 

SDPPPGAQSYAYLALNSEARAGDKESQARSL 

LARICLPHGAGGDAKKLTESYIQLGLQCAGG 

AGRGDLYSRLVSVFPARERLFAVFERPQGSPA 

ARAAPAALCAFRFADVRAAIRAARTACFVEP 

APDWAVLDSVVQGTGPACERKLNIQLQPEQ 

LDCGAAHLQHPLSILQPLKATPVFRAPGLTSV 

AVASVNNYTAVFLGfVNGRLLKrNLNESMQ 

WSRRVVTVAYGEPVHHVMQFDPADSGYLY 

LMTSHQMARVKVAACKVHSTCGDCVGAAD 

AYCGWCALETRCTLQQDCTNSSQQHFWTSA 

SEGPSRCPAMTVLPSETDVRQEYPGMILQISGS 

LPSLSGMEMACDYGNNIRTVARVPGPAFGHQ 

I A YCNLLPRDQFPPFPPNQDHVT VEMS VR VN 

GRNIVKANFTIYDCSRTAQVYPHTACTSCLSA 

QWPCFWCSQQHSCVSNQSRCEASPNPTSPQD 

CPRTLLSPLAPVPTGGSQNILVPLANTAFFQG 

AALECSFGLEEIFEAVWVNESVVRCDQVVLH 

TTRKSQVFPLSLQLKGRPARFLDSPEPMTVM 

V YNCAMG SPDCSQCLGREDLGHLCM WSDGC 

RLRGPLQPMAGTCPAPEUIAIEPLSGPLDGGT 

LLTIRGRNLGRRLSDVAHGVWIGGVACEPLP 

DRYTVSEEIVCVTGPAPGPLSGVVTVNASKE 

GKSRDRFSYVLPLVHSLEPTMGPKAGGTRJTI 

HGNDLHVGSELQVLVNDTDPCTELMRTDTS1 

ACTMPEGALPAPVPVCVRFERRGCVHGNLTF 

WYMQNPVITAISPRRSPVSGGRTITVAGERFH 

MVQNVSVIAVFDI1GREPTLCKVLNSTLITCPSP 

GALSNASAPVDFFINGRAYADEVAVAEELLD 

PEEAQRG SRFRLD YLPNPQFSTAKREK WIKH 

HPGEPLTLVIHVSTKGAGKEQDSLGLQSHEY 

RVKJGQVSCDIQIVSDRIIHCSVNESLGAAVGQ 

LPITIQVGNFT^QTIATLQLGGSETAIIVSIVICSV 

LLLLSVVALFVFCTKSRRAERYWQKTLLQME 

EMESQIREEIRKGFAELQTDMTDLTKELNRSQ 

GIPFLEYKHFVTRTFFPKCSSLYEERYVLPSQT 

LNSQGSSQAQETHPLLGEWKIPESCRPNMEE 

GISLFSSLLDNKHFLIVFVHALEQQKDFAVRD 

RCSLASLLTIALHGKJLEYYTS1MKELLVDLID 

ASAAKNPKLMLRRTESVVEKMLTNWMSICM 

YSCLRETVGEPFFLLLCAIKQQINKGSIDAITG 

KARYTLNEEWLLRENtEAKPRNLNVSFQGCG 

MDSLSVRAMDTDTI.TQVKEKILEAFCKNVPY 

SQWPRAEDVDLE WFASSTQS YILRDLDDTS V 

VEDGRKKLNTLAHYKIPEGASLAMSLIDKKD 

in i LOK VJsJJLrJJI JbK. Yr HL VU 1 DELAEPKKSJrl 

RQSHRKKVLPEIYLTRLLSTKGTLQKFLDDLF 

KAILSIREDKPPLAVKYFFDFLEEQAEKRGJSD 

PDTLHIWKTNSLPLRF WVNILKNPQF VFDI DK 

KLLYAKEIPEYRKIVQRYYKQIQDMTPLSEQE 
MNAJILAEESRKYQNEFNTNVAMAEIYKYAK 
RYRPQIMAALEANPTARRTQLQHKFEQVVAL 
MEDNJYECYSEA 


730 


2080 


A 


5744 


3 


292 


QPSPLFHSHLETLQLLRTAQLPEQVSWPWGQ 
VANGKGNQRNMGSPQPSLLAFERNLELQIMG 
LGYSLLMGKLRPRVAKDTLRVHRDSTPSPLT 
LKD 


731 


2081 


A 


5747 


1 


382 


FLKCMRKAFRSSKLLQVGYTPDGKDDYRWC 
FRVDEVNWTTWNTNVGIINEDPGNCEGVKRT 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of ! 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcine 5 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
l=Tsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline 3 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V— valine, w— iryptopnan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














LSFSLRSSRVSGRHWKNFALVPLLREASARD 

RQSAQPEEVYLRQFSGSLKPEDAEVFKSPAAS 

GEK 


732 


2082 


A 


5753 


198 


3 


AQAESSTVASPEATAGPLCTRIPNVPPPTPIRP 
PGKLQAQLPCPSPVRFTSARIPPASRPQTKS 


733 


2083 


A 


5754 


2 


2223 


AAGPPGLEAEGRAPESAGPGPGGDAAETPGL 

PPAHSGTLMMAFRDVTVQIANQNISVSSSTAL 

SVANCLGAQTVQAPAEPAAGKAEQGETSGR 

EAPEAPAVGREDASAEDSCAEAGASGAADG 

ATAPKTEEEEEEEETAEVGRGAEAEAGDLEQ 

LNRTSTSTKSAKSGSEASASASKDALQAMILS 

LPRYHCENPASCKSPTLSTDTLRKRLYRIGLN 

LFNINPDKGIQFLISRGFIPDTPIGVAHFLLQRK 

GLSRQM1GEFLGNSKKQFNRDVLDCVVDEM 

DFSSMELDEALRKFQAHIRVQGEAQKVERLIE 

AFSQRYCMCNPEVVQQFHNPDTIFILAFAIILL 

NTDMY SPNIKPDRKMMLEDFIRNLRGVDDG 

ADIPRELVVGIYERIQQKELKSNEDHVTYVTK 

VEKSIVGMKTVLSVPHRRLVCCSRLFEVTDV 

NKLQKQAAHQREVFLFNDLLVILKLCPKKKS 

SSTYTFCKSVGLLGMQFQLFENEYYSHGITLV 

TPLSGSEKKQVTHFCALGSDEMQKFVEDLKE 

SJAEVTELEQIRIEWELEKQQGTKTLSFKPCGA 

QGDPQSKQGSPTAKREAALRERPAESTVEVSI 

HNRLQTSQHNSGLGAERGAPVPPPDLQPSPPR 

QQTPPLPPPPPTPPGTLVQCQQIVKVIVLDKPC 

LARMEPLLSQALSCYTSSSSDSCGSTPLGGPG 

SPVKVTHQPPLPPPPPPYNHPHQFCPPGSLLH 

GHRYSSGSRSLV 


734 


2084 


A 


5788 


8 


362 


SSVMGDLVGQGLEEQIVARDENSWLIDGGTP 
IDDVMRVLDIDEFPQSGNYETIGGFMMFMLR 
KIPKRTDS VKFAG YKFE WDIDN YRIDQLL VT 
RIDSKATALSPKLPDAKDKEESVA 


735 


2085 


A 


5827 


1 


1257 


M VFSAVLTAFHTGTSNTTF V V YENTYMNITL 

PPPFQHPDLSPLLRYSFETMAPTGLSSLTVNST 

AVPTTPAAFKSLNLPLQITLSAIMIFILFYSFLG 

NLVVCLMVYQKAAMRSAINILLASLAFADM 

LLAVLNMPFALVTILTTRWIFGKFFCRVSAMF 

FWLFVIEGVAILLIISIDRFLHVQRQDKLNPYR 

AKVLIAVSWATSFCVAFPLAVGNPDLQIPSRA 

PQCVFGYTTNPGYQAYVILISLIarrlrrJLVLLr 

SFMGILNTLRHNALRJHSYPEGICLSQASKLGL 

MGLQRPFQMSIDMGFKTRAFTTILILFAVFIVC 

WAPFTTYSLVATFSKHFYYQHNFFEISTWLL 

WLCYLKSALNPLIYYWRIKKFHDACLDMMP 

KSFKFLPQLPGHTKRRIRPSAVYVCGEHRTVV 


736 


2086 


A 


5870 


3 


268 


FTRSDELARHYR I H I UhiKRhbiUPLCFKQr bKb 
DHLTKHARRHPTYHPDMIEYRGRRRTPRIDPP 
LTSEVESSASGSGPGPAPSFTTCL 


737 


2087 


A 


5871 


2 


521 


" LTWPQLFLETLPELLHMSRPAEDGPSPGALVR 
RSSSLGYISKAEEYFLLKSRSDLMFEKQSERH 
GLARRLTTARRPPAS SEQAQQELFNELKPAV 
DGANFIVN T HMRDQNNYNEEKDSWNRVART 
VDRLCLFVVTPVMWGTAWIFLQGVYNQPPP 
QPFPGDPYSYNVQDKRFI 


738 


2088 


A 


5881 


1 


1160 


LVVTAITAILAFPNEYTRMSTSELISELFNDCG 
LLDSSKLCDYENRFNTSKGGELPDRPAGVGV 
YSAMWQLALTLILKIVITIFTFGMKIPSGLFIPS 
MAVGAIAGRLi^GVGMEQLAYYHQEWTVFNS 
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T/US0 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

UCU)| J 111 11 

nucleotide 
location 
correspond! 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

UUClCUllUC 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

Ls — ^-VoJJill 111-- ri.^IU, C — UllildllliW /\C1Q, 

F=Phenylalanine J G=-Glycine s H=Histidine, 
I=lsolcucinc, K=Lysine. L=Leucinc, 
M=Methionine, N=Asparaginc s P=Pro!ine. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan, 
Y=Tyrosine, X= Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














WCSOG ^DPITPOT YAMVCJAAAPI HPiVTRMT 

VSLVVIMFELTGGLEYIVPLK4AAAMTSKWVA 

DALGREGLYDAHIRLNGYPFLEAKEEFAHKTL 

AMDVMKPRRNDPLLTVLTQDSMTVEDVETII 

SETTYSGFPVWSRESQRLVGFVLRRDLIISIE 

NARKKQDGWSTSIIYFTEHSPPLPPYTPPTLK 

LRN1LDLSPFTVTDLTPMEIWDIFRKLGLRQC 

LVTHNGRLLGIITKKDVLKHIAQMANQDPDSI 

LFN 


739 


2089 


A 


5892 


2 


916 


TLQLAASVPFFAISL1SWWLPESARWLIINGKP 

KEEVASAKEPRSVLDLFCVPVLRWRSCAMLV 
VNFSLLISYYGLVFDLQSLGRJDIFLLQALFGA 
VDFLGRATTALLLSFLGRRTIQAGSQAMAGL 
AILANMLVPQDLQTLRVVFAVLGKGCFGISL 

TCI TTYk^AFT FPTPVl? M T A nr. n HTVP.Pl t~lA 

MMGPLILMSRQALPLLPPLLYGVISIASSLVVL 

FFLPETQGLPLPDTIQDLESQKSTAAQGNRQE 

AFTVESTSLLEIVALHGAL 


740 


2090 


A 


5900 


2 


426 


RPIKTLGIGFHFSVDGVHFLTQREVQNLWKE 
"Ml Hi nTA^i^HCVFwrkTPTiTunDVk'rci nr, 

wi-/i.LLjLJ i Arvr\.nvj 1 LV VUI I ill ivioix I IVJL, rLyU 

KCGCHFHEWKSKLSKEYNF1KMKRSRNHIM 
GRYFSNQSKLQQGTVTNFRSPYHVRGPINQV 
CSEILLSRMCANKRTM 


741 


2091 


A 


5910 


3 


412 


RMPESTLLUCENGYILEAPLPTIKQEEDDHDV 
VSTOIKDMCIKCFHFSSVKSKILRLIEIEKRER 

ORF1 k"FK r TRFFRI}>Jl< r T A A PMP.Pnr.Plf CFfiFF 

EEEKEEEEEEEEPLPEIFIPSTPSPrLCGFYSEPG 
KFWV 


742 


2092 


A 


5936 


1 


482 


MGCRLLCCVVFCLLQAGPLDTAVSQTPKYLV 

TQMGNDKS IKCEQNL GHDTM Y W YKQDSKK 

FLKIMFSYNNRELIINETVPNRFSPKSPDtLAHL 

NLHINSLELGDSAVYFCASSQDTALQSHCIPV 

HKPPGSARKLQGSVCTCTQGSSLHSLMASDG 

VPVC 


743 


2093 


A 


5938 


1 


1566 


MNSFFGTPAASWCLLESDVSSAPDKEAGRER 

RALSVQQRGGPAWSGSLEWSRQSAGDRRRL 

GLSRQTAKSSWSRSRDRTCCCRRAWWILVPA 

ADRARRERFIMNEKWDTNSSENWHPI\VNVN 

DTKFIHLYSDINITYVNYYLHQPQVAAIFIISYF 

LIFFLCMMGNTVVCFIVMRNKHMHTVTNLFI 

LNLA1SDLLVGIFCMP1TLLDN11AGWPFGNTM 

CiaSGLVQGJSVAASVFTLVAIAVDRFQCVVY 

VQEEKYYRVRLNSQNKTSPVYWCREDWPNQ 

FMRKIYTTVI FANTYT APT <?T TVIMYnRfOI^T F 

RAAVPHTGRKNQEQWHVVSRKKQKIIKMLLI 

VALLFILSWLPLWTLMMLSDYADLSPNELQII 

NIYIYPFAHWLAFGNSSVNPirYGFFNENFRRG 

FQEAFQLQLCQKJIAKPMEAYAJ^KAKSITVLIN 

TSNQLVQESTFQNPHGETLLYRKSAEKPQQE 

LVMEELKETTNSSEI 


744 


2094 


A 


5966 


149 


327 


SHVCVSHYAGSSGCPAGAGAGAVALGISAVA 
LYDYQGGRLGVARGAWYMEAPDIRQGDM 


745 


2095 


A 


5970 


413 


856 


GAPHTDWAWAPTPMSGLGSGRGRQGTLASS 
PLSLPLLLAGVTGILATELFDQMARPAACMV 
CGALMWIMLILVGLGFPFIMEALSHFLYVPFL 
GVCVCGAIYTGLFLPETKGKTFQEISKELHRL 
NFPRRAQGPTWRSLEVJQSTEL 
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5DOCID: <WO 01571B8A2_I_> 



WO 01/57188 



PCT/USOI/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C-Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K^Lysine, L=Leucine, 
M-Methionine, N=Asparagine 3 P=Proline f 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possiblc 
nucleotide insertion 


746 


2096 


A 


5971 


3 


1343 


AQTARRIIGLELDTEGHRLFVAFSGCIVYLPLS 

RCARHGACQRSCLASQDPYCGWHSSRGCVDI 

RGSGGTDVDQAGNQESMEHGDCQDGATGSQ 

SGPGDSAYGVRRDLPPASASRSVPIPLLLASV 

AAAFALGASVSGLLVSCACRRAHRRRGKDIE 

TPGLPRPLSLRSLARLHGGGPEPPPPSICDGDA 

VQTPQLY 11 hLP Pr nCr V r r rc,L, AL- L-r 1 rt-b 1 rc, 

LPVKHLRAAGDP WE WNQNRN N AKEGPGRSR 

GGHAAGGPAPRVLVRPPPPGCPGQA\ r EVTTL 

EELLRYLHGPQPPRKGAEPPAPLTSRALPPEP 

APALLGGPSPRPHECASPLRLDVPPEGRCASA 

P ARPALSAPAPRLGVGGGRRLPFSGHRAPPAL 

LTRVPSGGPSRYSGGPGKHLLYLGRPEGYRG 

RALKRVDVEKPQLSLKPPLVGPSSRQAVPNG 

GRFNF 


747 


2097 


A 


5998 


2 


754 


DliASLPCSWNHRFDVETRHVFlOUrlbuQV 1 i 

LKLEQENCTLVTTFRGHTGGVTALCWDPVQ 

RVLFSGSSDHSVIMWDIGGRKGTAIELQGHN 

DRVQALSYAQHTRQLISCGGDGGIVVWNMD 

VERQETPEWLDSDSCQKCDQPFFWNFKQMW 

DSKKIGLRQHHCRKCGKAVCGKCSSKRSSIPL 

MGFEFEVRVCDSCIIEAITDEER-A.PT ATFHDSK 

HNnWHFDATRGWLLTSGTDKVIKLWDMT 

PVVS 


748 


2098 


A 


6001 


2 


747 


AMVFGGVVPYVPQYRD1RRTQNADGFSTYV 
CLVLLVANILR1LFWFGRRFESPLLWQSAIMIL 
TMLLMLKLCTEVRVANELNARRRSFTAADS 
KDEE VK V APRRS FLDFDPHHF WQ WS SF S D Y V 
QCVLAFTGVAGYI1 YEMDbALr vfci LurL/w 
LTEAMLGVPQLYRNHRHQSTEGMSIKMVLM 
WTSGDAFKTAYFLLKGAPLQFSVCGLLQVLV 
DLAILGQAYAFARHPQKPAPHAVHPTGTKAL 


749 


2099 


A 


6002 


2 


447 


GRPDRSELVRMHILEETFAEPSLQATQMKLK 

t-» a t»t a tvtm vn?l/T A /"YD DPDN /CI \7T7VMTT P\/T^Q { \ 

RARL ADDLNEKIAQKrOrM t, L V nlviN iL,r V Do o 
VKEAIIGVGKEDYPHTQGDFSFDEDSSDALSP 
DQPASQESQGSAASPSEPKVSESPSPVTTNTP 
AQFASVSPTVPEFLKTPPTAD 


750 


2100 


A 


6004 


2 


427 


LLTQAMLVLPHRPQWFTPGPRLQAQGPCQEG 
WRWELRLRNYVPEDEDLNKRRVPQAKPDAV 
QEKVKEQLEAAKPEPVIEEVmAKXAPRKPD 
WDLKRDVAI<KLEI<XLKRTQRA1 AELIRERLK 
GQEDSLDSAVDAATEHKTC 


751 


2101 


A 


6007 


33 


1280 


TDQAKVDNQPEKLVRSAEDVSTVPTQPDNPF 

SI IPDKLKRMSKS VP AFLQDESDDRETDTASE 

SSYQLSRHKKSPSSLTNLSSSSGMTSLSSVSGS 

VMSVYSGDFGNLEVKGNIQFA1EYVESLKEL 

HVFVAQCKDLAAADVKKQRSDPYVKAYLLP 

DKGl<^GKKKTLVvKKTLNP V Y N blLK Y KIbK 

QILKTQKXNLSIWHRDTFKRNSFLGEVELDLE 

T^WDNKQNKQLRWYPLKRKTAPVALEAE 

NRGEMKLALQYVPEPVPGKKLPTTGEVHrWV 

KECLDLPLLR GSHLNSFVKCTILPDTSRKSRQ 

KTRAVGKTTNPIFNHTMVYDGFRPEDLMEAC 

VELTVWDHYKLTNQFLGGLRIGFGTGKSYGT 

E\T>WMDSTSEEVALWEKMVNSPNTWIEATL 

PLRMLLIAKISK 


752 


2102 


A 


6028 


108 


1283 


KEIFSPFELISVKPLCLLLGVTCSQSMAFEELL 

SQVGGLGRFQMLHLVFILPSLMLLIPH1LLENF 

AAAIPGHRCWVHA^LJNNTGSGNETGILSEDA 
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>JSDOCID- <WO 0157188A2_I_> 



WO 01/57188 




T/US01/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid. E=Glutamic Acid, 
F=Phenylalanine, G=Glycine.. H=Histidine, 
1=1 so leucine, K=Lysine, L=Leucine, 
M=Methionine ; N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine ; V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon s 
/=poi>sibIe nucleotide deletion, \=possible 
nucleotide insertion 














LLRISIPLDSNLRPEKCRRFVHPQWQLLHLNG 

TlHSTSEADIbrCVDGWVYDQbYrPM IV I KW 

DLVCDYQSLKSWQFLLLTGMLVGGJIGGIIV 

SDRFGRRFILRWGLLQLAITDTCAAFAPTFPV 

YCVl.RFLAGFSSMIIISNNSLPITEWIRPNSKAL 

WILSSGALNIGQIILGGLAYWRDWQTLHVV 

ASVPFFVFFLLSRWLVESARWLIITNKJLDEGL 

KALRKVARTNGIKNAEETLNIEVVRSTMQEE 

LDAAQTKTTVWDLFRNPSMRKRJCILVFLRK 

KNLKEICA 


753 


2103 


A 


6043 


■ — | 


1470 


DSFESILRLIFE1HHSGEKGDIVVFLACEQDIEK 

VCETVYQGSNLNPDLGELVVVPLYPKEKCSL 

FKPLDETEKRCQVYQRRWLTTSSGEFLIWSN 

SVRFVIDVGVERRKVYNPRIRANSLVMQPISQ 

SQAEIRKQILGSSSSGKFFCLYTEEFASKDMTP 

LKPAEMQEANLTSMVLFMKRIDIAGLGHCDF 

MNRPAPESLMQALEDLDYLAALDNDGNLSE 

FGIIMSEFPLDPQLSKSILASCEFDCVDEVLTIA 

AMVTAPNCFSHVPHGAEEAALTCWKTFLHPE 

GDHFTLISIYKAYQDTTLNSSSEYCVEKWCRD 

YFLNCSALRMADVIRAELLEIIKRIELPYAEPA 

FGSKENTLNIKKALLSGYFMQlARD\ r DGSGN 

YLMLTHKQVAQLHPLSGYSITKKMPEWVLF 

HKTSISENNYlRJTSEISPELFMQbVPQYYFSNb 

PPSESKDILQQVVDHLSPVSTMNKEQQMCET 

CPETEQRCTLQ 


754 


2104 


A 


6055 


2 


394 


YYALHHWPFPDLLCQTTGAIFQMNMYGSCIF 

LMLTNVDRYAAIVHPLRLRHLRRPRVARLLC 

LGVWALILVFAVPAARVHRPSRCRYRDLEVR 

LCFESFSDELWKGRLLPLVLLAEALGFLLPLA 

AVVYSS 


755 


2105 


A 


6059 


3 


1795 


LGLGSGTLLSVSEYKKKYREHVLQLHARVKE 

RNARSVKITKRFTKLLIAPESAAPEEALGPAEE 

PEPGRARRSDTHTFNRLFRRDEEGRRPLTVVL 

QGPAGIGKTMAAKKILYDWAAGKLYQGQVD 

FAFFMPCGELLERPGTRSLADLILDQCPDRGA 

PVPQMLAQPQRLLFILDGADELPALGGPEAAP 

CTDPFEAASGARVLGGLLSKALLPTALLLVTT 

RAAAPGRLQGRLCSPQCAEVRGFSDKDKKK 

YFYKFFRDERRAERAYRFVKJENETLFALCFV 

PFVCWIVCTVLRQQLELGRDLSRTSK'lT'rSVY 

LLFITS VLS S APV ADGPRLQGDLRNLCRL ARE 

GVbGRRAQr Ab-KbbEQLbLKubKVQ 1 Lr LbK 

KELPGVLETEVTYQFIDQSFQEFLAALSYLLE 

DGG VPRT AAG G VGTLLRGD AQPH SHL VLTT 

RFLFGLLSAERMRDIERHFGCMVSERVKQEA 

LRWVQGQGQGCPGVAPEVTEGAKGLEDTEE 

r EbbbbUbbrlN Y Fbfcbb Y CL Y b 1 l^bUAr V KQ A 

LCRFPELALQRVRFCRMDVAVXSYCVRCCPA 

GQ ALRLI SCRL V AAQEKKKKSLGKRLQ ASLG 


756 


2106 


A 


6060 


12 


436 


SGRPTRPAKPU'GQGMGRFMLTLVCQGSIMMS 
ARDLIMNNLTELQPGLFHHLRFLEELRLSGNH 
LSIIIPGQAFSGLYSLICILMLHNNQLGGIPAQA 
LWELPSLQSLRLDANUSLVPERSFEGLSSLRH 
LWLDDNALTEIPS 


757 


2107 


A 


6063 


54 


419 


1TPLGLGAADMCAFPWLLLLLLLQEGSQRRL 
WRWCGSEEVVAVLQESISLPLEIPPDEEVENII 
WSSHKSLAT\ r VPGKEGHPATIMVTNPHYQG 
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3 DOC ID; <WO 01571B8A2_I_> 



WO 01/57188 A A PCT/US01/03800 

^^^^ - «"•«' 'li.rl' *^V< llwl 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end J 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F = Pnenylalanine. O— Ulycinc, \-\— Misname, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
i — i nreonine, v— vanne, w — lrypujpuaii, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














QILTMLLRSLQQPSASWPRDCSSSCSW 


758 


2108 


A 


6066 


125 


438 


IGISCPATIFVPMFSHSLIGIGEEYQLPYYNMV 
PSDPSYEDMREVVCVKRLRPIVSNRWNSDEC 

t Ti x T7T t^t v/CEfu/ A LXMD A CDT TAT T) TV V TT A V 

LRA VLKLMbr.t W AHN r AoKL 1 ALrUKKl J-./YJv 
MVESQDVK1 


759 


2109 


A 


6072 


3 


650 


PGRRFRP AALEERAMEKLREK VPFQN RGKGT 

LSSIIPNNSDTRKATETTSLSSKPEYVNPDFRW 

SKDPSSKSGNLLETSEVGWTSNPEELDPIRLA 

LLGKSGLSCQVGSATSHPVSCQEP1DEDQRISP 

KDKSTAGREFSGQVSHQTTSENQCTPIPSSTV 

HSSVADMQNMPAAVHALLTQPSLSAAPFAQ 

RYLGTLPSTGS1 lLFQCHAufNAi v w 


760 


2110 


A 


6077 


3 


730 


PLRLTLMEEVLLLGLKDREGYTSFWNDCISSG 

LRGCMLIELPLRGRLQLEACGMRRKSLLTRK 

VICKSDAPTGDVLLDEALKHVKETQPPETVQ 

NWIELLSGETWNPLKLHYQLRNVRERLAKNL 

VEKGVLTTEKQNFLLFDMT1 HPJL 1 InNNIKV^K 

LIKKVQEAVLDKWVNDPHRMDRRLLALIYL 

AHASDVLENAFAPLLDEQYDLATKRVRQLLD 

LDPEVECLKANTNEVLWAVVAAFTK 


761 


2111 


A 


6078 


833 


390 


IVSFHLSGFKKFVRPFSFLSVHGLQVDEYHSV 
HQ KLS ADM ADHSNLIRSLL VG AED ARLMRD 
MKTMKSRYMELYDLNRDLLNGYKIRWNNH 
TELLGNLKAVNQAIQRAGRLRVGKPKNQVIT 
ACRDAIRSNNINTLFKIMRVGTASS 


762 


2112 


A 


6079 


2 


2686 


KKAITCGEKEKQDLIKSLAMLKDGFRTDRGS 

HSDLWSSSSSLESSSFPLPKQYLDVSSQTDISG 

SFGT1MSNNQLAEKVRLRLRYEEAKRRIANLKI 

QLAKLDSEAWPGVLDSERDRL1LINEKEELLK 

EMRFISPRKWTQGEVEQLEMARKRLEKDLQ 

AARDTQSKALTERLKLNSKRNQLVRELEEAT 

RQVATLHSQLKSLSSSMQSLSSGSSPGSLTSSR 

GSLVASSLDSSTSASFTDLYYDPFEQLDSELQ 

SKVEFLLLEGATGFRPSGCITTIHEDEVAKTQ 

KAEGGGRLQALRSLSGTPK.SMTSLSPRSSLSS 

PSPPCSPLMADPLLAGD AFLNSLEFEDPEL S A 

TLCELSLGNSAQERYRLEEPGTEGKQLGQAV 

NTAQGCGLKVACVSAAVSDESVAGDSGVYE 

ASVQRLGASEAAAFDSDESEAVGATRIQIALK 

YDEKNKQFAILI1QLSNLSALLQQQDQKVNIR 

VAVLPCSESTTCLFRTRPLDASDTLVFNEVFW 

VSM S YPALHQKTLRVDV CTTDRSHLEECLGG 

AQISLAEVCRSGERSTRWYNLLSYKYLKKQS 

RELKPVGVMAPASGPASTDAVSALLEQTAVE 

LEKRQEGRS STQTLEDS WRYEETSENEA VAE 

EEEEEVEEEEGEED VFTEl^SPDiVlJJu Y rAJLrv 

VDKETNTETPAPSPTVVRPKDRRVGTPSQGPF 

LRGSTIIRSKTFSPGPQSQYVCRLNRSDSDSST 

LSKKPPFVRNSLERRSVRMKHPSPPPQPSSVK 

SLRSERLIRTSLDLELDLQATRTWHSQLTQEIS 

VLKELKEQLEQAKSHGEKFLPQWLREDERFR 

LLLRMLEKRMDRAEFrMGELQTDKMMRAAA 

KDVHRLRGQSCKEPPEVQSFREKMAFFTRPR 

MN1PALSADDV 


763 


2113 


A 


6082 


3 


1558 


PHPIRFSKLCVSFNNQEYNQFCVIEEASKANE 

VLENLTQGKMCLVPGKTRKLLFKFVAKTED 

VGKKIEITSVDLALGNETGRCWLNWQGGGG 

DAASSQEALQAARSFKRRPKLPDNEVHWGSII 

IQASTMIISRVPNISVHLLHLPPALTNEMYCLV 
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>JSDOCID- <WO 0157188A2J_> 



WO 01/57188 




T/US01/O380O 



•b„l< J^Jl JLj> ji .Q^; 



SEQ ID 

XTO- /vf 
INW. OI 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

lNW. OI 

peptide 
seq- 
uence 


Met 
nuu 


SEQ 

IU I s * \J. 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cy stein e, 
D=Aspartic Acid. E = GIutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidinc, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline s 
Q=G lutamine, K=Argtnine, S=Serine, 
T=Threonine ; V=Valine, W=Tryptophan. 
Y=Tyrosinc, X=Unknown, * s =Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














VTVQSHEICTQIRDVKXIAGLKLPGQDANLTQK 

THVTLHGTELCDESYPALLTDJPVGDLHPGEQ 

LEKMLYVRCGTVGSI^v^FLVYVSYLiNTTVEE 

ivel VOrvCJiisJJt. 1 v J lr* J V r rr U V A V K r V a I Kr 

EHLERVYADIPFLLMTDLLSASPWALTIVSSE 

LHLAPSMTTVDQLESQVDNVILQTGESASECF 

CLQCPSLGNIEGGVATGHYIISWKRTSAMENI 

PIITTVITLPHVIVENIPLHVNADLPSFGRVRES 

LPVKYIILQNKTDLVQDVEISVEPSDAFMFSG 

LKQ1RLRILPGTEQEMLYNFYPLMAGYQQLPS 

LN1NLLRFPNFTNQLLRRFIPTSIFVKPQGRI.M 

DDTSIAAA 


764 


2114 


A 


6093 


1 


1422 


AAADLANSNAGAAVGRKAGPRSPPSAPAPAP 

PPPAPAPPTLGNNHQESPGWRCCRPTLRERN 

ALMFNNELMADVHFVVGPPGATRTVPAHKY 

VLAVGSSVFYAMFYGDLAEVKSEIHIPDVEPA 

AFULLKYMYSDEIDLEADTVLATLYAAKKYI 

VPAL AKAC VN FLETSLE AK.N AC VL LSQ S RLF 

EEPELTQRCWEVIDAQAEMALRSEGFCEIDR' 

QTLEIIVTRE.ALNTKEAVvFEAVLNWAEAEC 

KRQGLPITPRNKRHVLGRALYLVRIPTMTLEE 

FANGAAQSDILTLEETH SIFLW YTATNKPRLD 

FPLTKRKGLAPQRCHRFQSSAYRSNQWRYRG 

RCDSIQFAVDRRVFIAGLGLYGSSSGICAEYSV 

KJELKRLG V V L AQNLTKFM S DG S SNTFP V WF 

EHPVQVEQDTFYTASAVLDGSELSYFGQEGM 

TEVQCGKVAFQFQCSSDSTNGTGVQGGQIPE 

LIFYA 


765 


2115 


A 


6099 


1 


1150 


SGFTHYAJYDFIVKGSCFCNVHADQCIPVHGF 

RP VKAPGTFHIV1 VH GK CMCKHN TAGS HCQH 

CAPLYNDRPWEAADGKTGAPNECRTCKCNG 

HADTCHFDVNVWEASGNRSGGVCDDCQHN 

TEuQYCQRCKPGFYRDLRRPFSAPDACfCPCS 

CHPVGSAVLPANSVTFCDPSNGDCPCKPGVA 

GRRCDRCMVGYWGFGDYGCRPCDCAGSCD 

PITGDCISSHTDIDWYHEVPDFRPVHNKSEPP 

WEWEDAQGFSALLHSGKCECKEQTLGNAKA 

rtuMKror VbKJKlLbAHUKO 1 hi vb, VN VKJK 

KVLKSTKLKIFRGKRTLYPESWTDRGCTCPIL 

NPGLEYLVAGHEDIRTGKLIVNMKSFVQFIWK 

PSLGRKVMDILKRECK 


766 


2116 


A 


6103 


2 


384 


MTAAATATVLKEGVLEKRSGGLLQEWKRICR 
CVLTERGLQLFEAKGTGGRPKELSFARIKAVE 

^ VJbo 1 uKrll lrl L.V 1 J^OOObIL)rKCrL,biJrO W 

NAQITLGLVKFKNQQAIQTVRARQSLGTGTL 
VS 


767 


21 17 


A 


O 1 uo 


1 


DhZ 


MARYY1IKYADQKALYTRDGQLLVGDPVAD 
NCCAEKJCTLPNRGLDRTKVPIFLGIQGGSRC 
LACVETEEGPSLQLEDVNIEELYKGGEEATRF 
TFFOSSSGSAFRLEAAAWPGWFLCGPAEPDD 
PVQLTKESEPSAimCFYFEQSW 


768 


2118 


A 


6109 


3 


292 


FILQAVLQLSSQEARYKAFGTCVSH1GAILAF 
YTPSVISS VMI IRVARCAAPHVHILLANFYLLF 
PPMVNPIIYGVKTKQIRDSLGSIPEKGCVNRE 


769 


2119 


A 


6110 


1 


711 


RHEPSCSNGVASTKSKQNHSKYPAPSSSSSSS 
SSSSSSSPSSVNYSESNSTDSTKSQHHSSTSNQ 
ETSDSEMEMEAEHYPNGVLGSMSTRIVNGAY 
KHEDLQTDESSMDDRHPRRQLCGGNQAATE 
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5DOCID: <WO 0157188A2J. > 



PCT/U SO 1/03800 

j,,,,,. >iaIl . ij^ji 4;„,jt 



SEQ ID 
NO: of ' 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A== Alanine C=Cysteine> 
D=Aspartic Acid. E=Glutamic Acid, 
FHPhenylalanine, GOlycine, r^Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline ; 
Q=Glutamine, R=Arginine, S=Serine, 
T^hreonine, V^Valine, \V=Tryptophan, 
Y=Tyrosine, X=Unkno\vn, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














RIILFGRELQALSEQLGREYGKNLAHTEMLQD 
. r pi t a vcr\Dn/QrD\/nnr^i fjptorfpVPA A I 
NSAILESQNLPKQPPLMLALGQASECLRLMA 
RAGLG SCSFARVDDYLH 


770 


2120 


A 


6125 


2 


570 


"YFGLNLHVQHLGNNVFLLQTI ,FG AVILL ANC 
V AP W ALKYMNRRASQMLLMFLL A1CLL AlIF 
VPQEMQMLREVLATLGLGASALANTLAFAH 

/TkmiiTnTT-m A T> A K/f/^lTXT A TCA"NJT A A I APT MM 
GNEV1P 1 IIRAKAMOIJN/Yl r AiN i.J\KJ/-\L J n.tr i_ ivi lvi 

ILSVYSPPLPWIIYGVFPFISGFAFLLLPETRNK 
PLFDT1 QDEKNERKDPREPKQEDPR VE VTQF 


771 


2121 


A 


6126 


909 


353 


RSFVLDTASAICNYNAHYKNHPKYWCRGYF 

RDYCNIIAFSPNSTNHVALRDTGNQL1VTMSC 

LTKEDTGWYWCGIQRDFARDDMDFTELIVT 

DDKGTL ANDF W SGKDLSGNKTRSCKAPKW 

RJKADRSRTSILIICILITGLGIISVISHLTKRRRS 

ORNRRYGNTLKPFSRVLTPKEMAPTEQM 


772 


2122 


A 


6148 


7 


810 


FVLGILALSHTIi5FFMNlsJ < Jt , i'Aii-rNK<^ Y <^Li-r 

TQGSGENKEEIINYEFDTKDLVCLGLSSIVGV 

WYLLRKHWIANNLFGLAFSLNGVELLHLNN 

VSTGCILLGGLFIYDVFWVFGTNVMVTVAKS 

FEAPIKLVFPQDLLEKGLEANNFAMLGLGDV 

VTPGTFI ALI JjRFDI SLKKNTHTYFYTSF AAYIF 

GLGLTIFIMHIFKHAQPALLYLVPACIGFPVLV 

ALAKGEVTEMFSYEESNPKDPAAVTESKEGT 

EASASKGLEKKEK 


773 


2123 


A 


6161 


3 


1088 


CQPML VTRKNHPKLLLRRTES V AEKMLTN W 
FTFLLYKFLKESAGEPLFMLYCA1KHQMEKG 
PIDAITGEARYSLSEDKLIRHLIDYKTLTLNCV 
NPENENAPEW VKGLDCD 1 u 1 QAivti^LUA 
AYKGVPYSQRPKAADMDLEWRQGRMARI1L 
QDEDVTTKIDNDWKRLNTLAHYQVTDGSSV 
ALVPKQTS AYN1SNS STFTKSLSRYESMLRT A 
S SPD SLRSRTPMITPDLESGTKL WHL VKNHDH 
LDQREGDRGSKN4VSEIYLTRLLATKGTLQKF 
VDDLFE11FSTAHRGSALPLA1KYMFDFLDEQ 
ADKHQIHDADVRHTWKSNCLPLRFWVNV1K 
NPQFVFDIHKJnM i jjaulo v v 


774 


2124 


A 


6163 


860 


125 


KTAVKKRNLNPVFNETLRYSVPQAELQGRVL 
SLSVWHRESLGRNIFLGEVEVPLDTWDWGSE 
PT WLPLQPRVPPSPDDLPSRGLLALSLK Y VPA 
GSEGAGLPPSGELHFWVKEARDLLPLRAGSL 

DTi VQCr VLrl^lyoKAoKv^rvi lv v \ ivAOL-or v r 

NI1TMVYDGFGPADLRQACAELSLWDHGALA 
NRQLGGTRLSLGTGSSYGLQVPWMDSTPEEK 
OLWQALLEQPCEWVDGLLPLRTOLAPRT 


775 


2125 


A 


6191 


2 


392 


ARGIGSLGRDHSGSGGGTGMAGAWVRKAAD 
YVRSKDFRDYLMSTHFWGPVANWGLPIAAIT 
DMK\KSPEIISRRMTFAL*CYSLTFVRFAHYVQ 
\PWNAVLMLGCHTAVDFDQLISSMPCISHGMT 
ASASAL 


776 


2126 


A 


6217 


i 
1 




FRGYWGVREAFTDASWSGGLGPGKPGMKIT 

RQKHAKKHLGFFRNNFGVREPYQILLDGTFC 

QAALRGRIQLREQLPRYLMGETQLCTTRCVL 

KELETLGKDLYGAKLIAQKCQVRNCPHFKNA 

VSGSECLLSMVEEGNPHHYFVATQDQNLSVK 

VKKKPGVPLMFIIQNTMVLDKPSPKTTAFVKA 

VESGXRLSQCMRKKVSNISKRNRV* *KTLNRG 

RRKKJ^KKJSGPNPLSCLKKKXKAPDTQSSASE 

KKRKRKRIRNRSNPKVLSEKQNAEGE 
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SEQ ID 
NU: ol 
nucl- 
eotide 
seq- 
uence 


SEQ ID 

rf*J. 01 

peptide 
seq- 
uence 


Met 
nod 


SEQ 
ID NU: 
in 

USSN 
09/496 

Q1 A 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine. 
D=Aspartic Acid, E=Glutamic Acid. 
F-^Phenylalanine 3 G=Glycine : H— Hisiidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Giutamine. R— Arginine, S— Serine. 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X^nknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


111 


2127 


A 


6236 


1038 


1402 


YYQISSLPSIVGNG1KLWLL1C1FLAKQGGSRL* 
r UivrKUvjunL Kau V Luy ru\^rlU 1 1 rVbrr 

YNSK1SPALWGPPVIPSALGGEAGKSL*PRRQ 
RFQRGG1APLPSRVRGRAKLFLI<JCK 


IIS 


2128 


A 


6237 


422 


913 


ASFFHHHRGAFLLLLAIPGS*GQDQSLIHWSN 

AVSNADYLLDLK\N*LDFALEEKMPL\EVKVVP 

ryl VL\bbrN*i<.bOOCr bAPbrbvrPWTGEVKr/ 

SPQRDGGALG\QGPLGIPSDSILALLKKQT*RA 

LLNWPLGSLRRSSCFGGQDGQDLKPRSGLGC 

NSFRYRR 


779 


2129 


A 


6249 


420 


36 


ARAPSPSFSVRDVELSDPARERGEMPVAVGP 

YGQSQPSCFDRVKMGFVMGCAVGMAAGAL 

FGTFSCLSSILVSSSG/SGMRGRELMGG1GKTM 

MQSGGTFGTFMAIGMGIRC*PWLPTTSVPSH 

QSQPMY 


780 


2130 


A 


6263 


415 


1380 


RJMRMCDRGIQMLITTVGAFAAFSLMTIAVG 

TDYWLYSRGVCRTKSTSDNETSRKNEEVMT 

HSGLWRTCCLEGAFRGVCKKIDHFPEDADYE 

QDTAEYLLRAVRASSVFPILSVTLLFFGGLCV 

AASEFHRSRHNVILSAGIFFVSAGLSNIIGIIVYI 

S\ANAGRTPGQR\DSICKSYSYGWSFATSGAFS 

FIIGR/IIC*GVGLPWHIYIEKHQQLRAKSHSEF 

LKKSTFARLPPYRYRFRRRSSSRSTEPRSRDLS 

PISKGFHTD^STDISMFTLSRDPSKITMGTLLNS 

DRDHAFLQFHNSTPKEFKESLHNNPANRRTT 

PV 


781 


2131 


A 


6274 


832 


318 


RJIKV1CDLKQTLAHCTAYPRCKCLVEMDQTFH 
LQVKQKQLACLCTWQARDPDCPPSTKVVL/L 
VGPGMGCMVALFQDSIAWSNKSMPSSLSA1S 
QSPCQVQAPEGPSSFHLPTLSFTTCLSWQGGD 
LEFLGDLKGCSELKNFQELITQSALVHPKADV 
WWYCGRPLLGTLPSN 


782 


2132 


A 


6281 


1324 


393 


WISLPSSLLCRKNGSSAEDDRRXGEPSAEEAEG 

EREDWG1GSA*SVGAVSKVPSARF*RTYPS\E 

DEEEVTHQKSSSSDSNSEEHRKKKTSRSRNK 

KKRKNKSSKRKHRKYSDSDSNSESDTNSDSD 

DDKJCRN^CAKKICKKKKiCHKT 

ESSDSSCKJDSEEDLSEATWMEQPNVADTMDL 

IGPEAPIJHTSQDEKPLKYGHALLPGEGAAMA 

E Y VKAGKKJPRRGE1GLTSEEIG SFECS G Y VM 

SGSRHRRJvlEAVRLRKENQIYSADEKRALASF 

NQEERRKRESKILASFREMVHKKTKGKDDK 


/ Oj 


91 11 


A 

A 




on i 
ZU 1 




\V uu Y r QOALKKKb AAtOLHr JLGr PGRVRGQ 
I/RGTTGPAWYCHSPSHSLLSAFCHLPTPSRCP 
AMARPPVPGS VVVPN WHES/RRGQG VPGLHS 
AQEPP AG V W AA* AAS AAAAVLS1DTAS YKIF V 
SGKSGVGKTALVAKLAGLEVPVVHHETTGIQ 
TTVVFWPAKLQASSRVVMFRFEFWDCGESA 
LKKFDHMLL ACMENTDAFLFLFSFTDRA S FE 
DLPGQLARJAGEAPGWRMVIGSKPDQYMHT 
DVPFRD1 TAKROAWF1 PI I RVkT^VPHRRI Ci 


784 


2134 


A 


6308 


86 


96 


GSSPDPASUTMKNQDKKNGAAKQSNPKSSP 
GQPEAGPEG AQERPS QAAPA VEAEGPGSSQA 
PRKPEGAQARTAQSGALRDVSEELSRQLEDIL 
STYCVDNNQGGPGEDGAQGEPAEPEDAEKSR 
TYVARNGEPEPTPVVNGEKEPSKGDPNTEEIR 
QSDEVGDRX>HRRPQEKJOCAKGLGlCEITLLM 
QTLNTL STPEEKL AALCKKY AELLEEHRNSQ 
KQMKLLQKKQSQLVQEKDPiLRGEHSKAVLA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue | 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid. E=Glutamic Acid, 
F=Phenytalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=<31utamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan. 
Y=Tyrosinc, X=Unknovvn, *=Stop codon, 
/=possible nucleotide deletion, ^possible 
nucleotide insertion 














RS fCLE SLCREL Q RH N K5> LKfcJbU v ^ K/uvE,£,isr. 

KRKEVTSHFQYTLNDIQLQMEQHNERNSKLR 

QENMELAERLKKXIEQYELREEHIDKVFKHK 

DLQQQLVDAKLQQAOEMLKEAEERHQREKD 

FLLKEAVESQRMCELMKQQETHLKQQLALY 

TRKFEEFQNTLSK S SE VFTTFKQEMEKMTKKJ 

KKLEKETTMYRSRWESSNKALLEMAEEKTV 

RDKELEGLQ V KlQKLiiKJ-oKAL» l / ua^ rvR 

GORWGSHRTSAVRIFS 


785 


2135 


A 


6319 


1493 


889 


SPQGPLLRSVSPVSAGASSVTPGGAQPGVTTT 

PPSI ,V A VAPAPGS AAGPA AGWQ*HAGCR/WT 

KXPWSWGMRPMKIFFSEEYRS1STRISHDAL* 

EKCTQP AKPLSMIRX 1 Gbb V kj/FLi v jv w in vv i 

RREFRNSGTRVVSSCCGMSCMYSFLGHCSV/S 

QDLPLVHVDVGWQPPLGPTVGLRPGLLPLHD 

TTPCQKLVVDDLDWA 


786 


2136 


A 


6320 


551 


135 


RWLPVAECDSSCVGCTGEGPGNCKECISGYA 
REHGQCADVDECSLAEKTCVRKNENCYNTP 
GS YVC VCPDGFEET/RRCLC AAGRG* SHRRRK 
PDTAALPRRP VMCRT YPLNYSEG CP VENVAL 
RMPSPAVDSGGERLPAL 


787 


2137 


A 


6330 


1693 


227 


DYVLTAELHRQRSPGVSFGLSVFNLMNAIMG 

SGILGL A YVMANTG VFGFSFLLLT VALLAS Y S 

VHLLLSMCIQTAYLGP*TNYFMVLPAH*LTCL 

PLIEFLQSL*NSL\*AVTSY£DLGLFAFGLPGKL 

VVAGTIIIQNIGAMSSYLLIIKTELPAAIAEFLT 

GD Y SRYWYLDGQTLLIHC VGI VFPL ALLPKIG 

FLG^ TSSLSFFrMMFFALVvlllsjwwaii i^rL. 1 1^ 

NY\ A EKGFQISNVTDDCICPKJLFHFSKESAYALP 

TMAFSFLCHTSILPIYCELQSPSKKRMQNVTN 

TA1ALSFL1YF1SALFGYLTFYD/GTTKAQRGE 

VTCHRIKDKVESELLKG* **IP* SHDWVMTW 

KLCILFAVLUTVPLIHFPARKAVTMMFFSNFP 

FSWIRHFLITLALNIIIVLLAIYVPDIRNVFGVV 

GASTSTCLIFIFPGLFYLKLSREDFLSWKKLGV 

GCFC/LLSFKTSILRNSLSVY1ILPASRKSIYFKI 


788 


2138 


A 


6351 


1 


6622 


PRSLCFSLWAEAAVLADGGLRRRRRLLRGTM 1 

SASFVPNGASLEDCHCNLFCLADLTGIKWKK 

YVWQGPTSAPILFPVTEEDPILSSFSRCLKADV 

LG/VWRRDQRPERREXL* IFWGGEDPWLLTLF 

TMTYQKKKMECGRMDFPMNAVLCFSKAVH 

NLLERCLMNRNFVRIGKWFVKPYEKDEKPrN 

KSEHLSCSFTFFLHGDSNVCTSVEINQHQPVY 

LLSEEHITLAQQSNSPFQVILCPFGLNGTLTGQ 

AFKMSDSATKKLIGEWKQFYPISCCLKEMSE 

EKQEDMDWEDDSLAAVEVLVAGVRMIYPAC 

FVLVPQSDIPTPSPVGSTHCSSSCLGVHQVPAS 

TRDPAMSSVTLTPPTSPEEVQTVDPQSVQKW 

VKFSSVSDGFNSDSTSHHGGKIPRKLANHVV 

DRVWQECNMNRAQNKRKYSASSGGLCEEAT 

AAKVASWDFVEATQRTNCSCLRHKNLKSRN 

AGQQGQAPSLGQQQQILPKHKTNEKQEKSEK 

PQKRPLTPFHHR VS V SDD VGMD\ADS\ASQRL 

V\ISAP\DSQ\VRFSNIR\TNDVAK\TPQMHGTE 

MANSPQPPPLSPM-1PCDWDEGVTKTPSTPQS 

QHFYQNrPTPDPLVPSKPMEDRIDSLSQSFPPQ 

YQEAVCTTVYVGTAVNLEEDEANIAWKYYK 

FPKKKDVEFLPPQLPSDKFKDDPVGPFGQESV 

TSVTELMVQCKKPLKVSDELVQQYQIKNQCL 

SA1ASDAEQEPKIDPYAFVEGDEEFLFPDKKD 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
nod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


.Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenyl alanine, G^Glycine, H=Histidine, 
I=Isoleuuine, K^Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, W-Tryptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon, 
/possible nucleotide deletion, \=possible 
nucleotide insertion 














RQNSEREAGKKHKVEDGTSSVTVLSHEEDA 

MSLFSPSIKQDAPRPTSHARPPSTSUYDSDLA 

VSYTDLDNLFNSDEDELTPGSKRSANGSDDK 

ASCKESKTGNLDPLSC1STADLHKMYPTPPSL 

EQHIMGFSPMNMNNKEYGSMDTTPGGTVLE 

GNSSSIGAQFKIEVDEGFCSPKPSEIKDFSYVY 

KPENCQILVGCSMFAPLKTLPSQYLPLIKLPEE 

CIYRQSWTVGKLELLSSGPSMPFIKEGDGSNM 

DQEYGTAYTPQTHTSCGMPPSSAPPSNSGAG1 

LPSPSTPRFPTPRTPRTPRTPRGAGGPASAQGS 

VKYENSDLYSPASTPSTCRPLNSVEPATVPSIP 

EAHSLYVNLILSESVMNLF1CDCNSDSCCICVC 

NMNIKGADVGVYIPDPTQEAQYRCTCGFSAV 

MNRKFGNNSGLFFEDELD11GRNTDCG1CEAE 

KRFEALRATSAEHVNGGLKESEKLSDDLILLL 

QDQCTNLFSPFG AADQDPFPKSG VI SN W VR V 

EERDCCNDCYLALEHGRQFMDNMSGGKVDE 

ALVKSSCLHPWSKRNDVSMQCSQDILRMLLS 

LQPVLQDAIQK1CRTVRPWGVQGPLTWQQFH 

KMAGRGSYGTDESPEPLPIPTFLLGYDYDYLV 

LSPFALPYWERLMLEPYGSQRDIAYVVLCPE 

NEALLNGAKSFFRDLTAIYESCRLGQHRPVSR 

LLTDGIMRVGSTASKKLSEKLVAEWFSQAAD 

GNNEAFSKLKLYAQVCRYDLGPYLASLPLDS 

SLLSQPNLVAPTSQSLITPPQMTNTGNANTPS 

ATLASAASSTN4TVTSGVAISTSVATANSTLTT 

ASTSSSSSSNLNSGVSSNKLPSFPPFGSMNSNA 

AGSMSTQANTVQSGQLGGQQTSALQTAGISG 

ESSSLPTQPHPDVSESTMDRDKVGIPTDGDSH 

AVTYPPAIVVYIIDPFTYENTDESTNSSSVWTL 

GLLRCFLEMVQTLPPHIKSTVSVQIIPCQYLLQ 

PVKHEDRErYPQHLKSLAFSAFTQCRRPLPTS 

TNVKTLTGFGPGLAMETALRSPDRPECIRLYA 

PPFILAPVKX>KQTELGETFGEAGQKYNVLFV 

GYCLSHDQRWILASCTDLYGELLETCIINIDVP 

NRARRKKSSARKFGLQKLWEWCLGLVQMSS 

LPWRVVIGRLGRJGHGELKDWSCLLSRRNLQ 

SLSKRLKDMCRMCGISAADSPSILSACLVAM 

EPQG SF VIMPDS VSTGS VFGRSTTLNMQTSQL 

NTPQDTSCTI-IILVFPTSASVQVASATYTTENL 

DLAFNPNNDGADGMGIFDLLDTGDDLDPD1I 

NILPASPTGSPVHSPGSHYPHGGDAGKGQSTD 

RLLSTEPHEEVPNILQQPLALGYFVSTAKAGP 

LPDWFWSACPQAQYQCPLFLKASLHLHVPSV 

QSDELLHSKHSHPLDSNQTSDVLRFVLEQYN s 

ALSWLTCDPATQDRJISCLPIHFVVLNQLYNFI 

MNML 


789 


2139 


A 


6359 


1 


2002 


TGTLTEDGLDVMGVVPLKGQAFLPLVPEPRR 

LPVGPLLRALATCHALSRLQDTPVGDPMDLK 

MVESTGWVLEEEPAADSAFGTQVLAVMRPP 

LWEPQLQ AMEEPP VP VS VLHRFPFS S ALQRM 

SVV VA WrvjAlQPfcAY VKGbrbLVAGLCNPET 

VPTDFAQMLQSYTAAGYRVVALASKPLPSVP 

SLEAAQQLTRDTVEGDLSLLGLLVMRNLLKP 

QTTPVIQALRRTRIRAVMVTGDNLQTAVTVA 

RGCGMVAPQEHLIIVHATHPERGQPASLEFLP 

MESPTAVNGVKDPDQAASYTVEPDPRSRPILA 

LSGPTFGI1VKHFPKLLPKVLVQGTWARMAP 

EQKTELVCELQKLQYCVGMCGDGANDCGAL 

KAADVGISLSQAEASWSPFTSSN4ASIECVPM 
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SEQ ID 
NO: of i 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 1 

IDNO: 

tn 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to Jast amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C=Cysteme, 
D=Aspartic Acid. E^Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine ; L^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=01utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Trvptophan, 
Y=Tyrosine, X=Unkno\vn, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














VIREGRCSLD 1 Sh S V b K Y MAL Y l^rib v Lii. 

YTINTNLGDLQFLAIDLVITTTVAVLMSRTGP 

ALVLGRVRPPGALLSVPVLSSLLLQMVLVTG 

VQLGGYFLTLAQPWFVPLNRTVAAPDNLPNY 

ENTVVFSLSSFQYLTT.AAAVSKGAPFRXRPLTN 

NVPFLLASAL* SS VLWLVLSPGLLHGPLALR 

NITDTGFKLLLVGLVTLNFVGGLHAGERARP 

VPPRLPAPPPAQAG\SKKRFKQLERELAEQPW 

PPLPAGPLR 


790 


2140 


A 


6380 


76 


1059 


SSAGSARKLQVN4ALAARLWRLLPFRRGAAP 

GSRLPAGTSGSRGHCGPCRFRGFEVMGNPGT 

FKRGLLLSALSYLGFETYQVlbQAAV vha J a 

KVEEILEQADYLYESGETEKLYQLLTQYKESE 

DAELLWRLARASRDVAQLSRTSEEEKKLLVY 

E ALEYAKRA/L/EKNESSFASHKWYA1CLSDV 

GDYEGIKAKIANAY1IKEHFEKAIELNPFCDATS 

IHLMGIWCYTFAEMPWYQRRIA*NACLQLPP 

* FPP YEKALG\YFHR AEQVDPNFY SKNLLLLG 

KTYLKLHNKKLAAFWLMKAKDYPAHTEED 

KOIOTEAAQLLTSFSEKM 


791 


2141 


A 


6434 


3 


1460 


1ALLIVDGLAWDDQGGLALLHISPSKLIL*QDS 

SGMS/YVMVRCTITRAFFKSLLCHICQYSIGPQ 

* VTACPGQDACKE* KSTAN * GG* RE* * PQ VLFF 

AFLSNPAVKFGRMSKKQRDSLYAEVQKHQQ 

RLQEQRQQQSGEAEALARVYSSSISNGLSNLN 

NETSGTYANGSVIDLPKSEGYYNVVSGQPSP 

DQSGLDMT\GIKQIKQEPIYDLTSVPNLFTY\SS 

FNNXGQLAPGITWfTEIDRIAQNHKSHLfc J CQ Y 

TMEELHQLAWQTHTYEE1KAYQSKSREALW 

QQCAIQITHA1QYVVEFAKRITGFMELCQNDQ 

1LLLKSGCLEWLVRMCRAFNPLNNTVLFEG 

KYGGMQMFKALGSDDLVNEAFDFAKNLCSL 

QLTEEEIALFSSAVLISPDRAWLIEPRKVQKLQ 

EKIYF ALQH VIQKNHLDDETLAKLI AKIPTITA 

VCNLHGEKLQVFKQSHPEIVNTLFPPLYKELF 

NPDCATACK 


792 


2142 


A 


6440 


92 


781 


"SRGTFRCFCRDFFPCFSNMRLFLWNAVLTLl-v 
TSLIGALIPEPEVKIEVLQKPFICHRKTKGGDL 
MLVHYEGYLEKDGSLFHSTHKHNNGQPIWFT 
LGILEALKGWGPG A* K/DMCVGEKRKLIIPPA 
LGYGKEGKGKIPPESTLIFN1DLLE1RNGPRSH 
ESFQEMDLNDDWKLSKDEVKAYLKKEFEKH 
GAVVNESHHDALVEDIFDKEDEDKDGFISAR 
EFTYKHDEL 


793 


2143 


A 


6446 


3201 


152 


PRJLKRLVVTEEDGGARPEALGK1APRTPAELG 

ARADQELVTALMCDLRRPAAGGMMDLAYV 

CEWEKWSKSTHCPSVPLACAWSCRNLIAFTM 

DLRSDDQDLTRMIHILDTEHPWDLHSIPSEHH 

EAJTC\LEWDQSGFPGFLFSRWPTGQ1K\CWS 

MGVSTLA\NSWE\SSVGSL\VEGGPHLWALS\ 

WLHXNGVKLALHVEKSGASSFGEKFSRWKFS 

P\SLTLF\GGNAMEGW1AVTVSGLVTVSLLQ\P 

SGQVUTSTVESLCRLRARVALADIAFTGGGNI 

VVATADGSSAXSPVQFYICVCVSVVSEKCRIDT 

DILPSLFMRCTTDLNRKDKFPAITHLKFLARD 

MSEQVLLCASSQTSSIVECWSLRKEGLPVNNI 

FQQISPWGDKQPTILKWRILSATNDLDRVSA 

V\ALPKLPISLTNTDLKVASDTQFYPGLGLAL 

AFHDGSVHIVHRLSLQTMAVFYSSAAPRPVD 

EP AMKRPRTAGPA VHLKAMQLS WTSL AL VG 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 

NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
lj — /-vspanii^ /-vcia, n^vjiuiamic /\ciu, 
F=Phenylalaninc } GKjlycinc, H=Histidine, 
I=Isoleucine, K=Lysine, L-Leucine, 
M=Methionine, N=Asparagine, P=Proline r 
Q=Glutamine ? R-Arginine, S=Serine, 
T^Threonine, V=VaIine s W=Tryptophan. 
Y=Tyrosine, X=Unkno\vn 3 *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














IDSHGKLSVXLRLSPSMGHPLEVGLALRPILLFL" 
LEYCM VTG YD WWD1LLHVQPSM VQSLVEKX 
HEEYTRQTAALQQVLSTRJLAMKASLCKLSP 
CTVTRVCDYHTKLFLIA1SSTLKSLLRPIIFLNT 
PDK SPGDRJLTEICTKITD VDIDKVMINLKTEEF 
VLDMNTLQALQQLLQWVGDFVLYLLASLPN 
QPCPTSEPCPTSEPSPTSEPSPTSEPSSP*SLC\G 
SLLRPGHSFLRDGTSLGMLRELMVVIRIWGLL 
KroLLr v r i A.1 CiL) I ^JJoMoLLrKLL I ivL» WiCC 

RDEGPASEPDEALVDECCLLPSQLLIPSLDWL 
PA^nni v<sPT nPKTjpi pi r»prvRAPTT pp,qaat 

LQLDGLARAPGQPKIDHLRRLHLGACPTEEC 

KACTRCGCVTMLKSPNRTTAVKQWEQRW1K 

NC/LVRWALVAGAPQLPLSPAAPQLLLSYPSA 

APEPGCCKSHRSPWTLLGAVNLSPPCRAVEG 

RGPDACVTSRASEEAPAFVQLGPQSTHHSPRT 

PRSLDHLHPEDRP 


794 


2144 


A 


6490 


418 


585 


tvJnn»kT A P&T PXIP^ri? Ar»\/1 ~KAT>\t\TT>ai wit: a "cr^r^ 
in vji i ^ rs./-\ i^i^i21n r./>3^ixj L vv^ v JLtivir v v r/iL W ii/\.t,ljO 

GSIEPRDLRLQ*AVITPL\TPAWVTQ 


795 


2145 


A 


6499 


395 


1027 


KLLWLPPHSEQKRSPLYHPQGPSGTTPSAP\FS 

SHSPPPSLLQAXPSIAAFLRTHGHISASGPLRMP 

FPH/H*NAFLLVFPGQRSQLTS/PSHYLCREVFP 

DHHHHLCRLSLES SPLFHHR VLFC VPKQN VN 

STRAQIFCLF VHI VGCRCINTFPLHLFRLHLW L 

HFLQIPLCKKNKSVKLGKTWGRGCQSAAGS 

DTRVRAAVGAPGLPVEPLV 


796 


2146 


A 


6503 


68 


936 


I IS ALLTHS SFC VFTLCQDFFT YS SMSEEVT Y A 

DHJr QNoMiMnKIJrjblOKr OEKAJ-'rAPSH VWR 

PAALFLTLLCLLLLIGLGVLASMFHVTLK1EM 

KKMNKLQNISEELQRN1SLQLMSNMN1SNKIR 

NLSTTLQTIATKLCRELYSKEQEHKCKPCPRR 

WIWHIO)SCYFLSDDVQTWQESKMACAAQN 

ASLLKJTNNKNALEFIKSQSRSYDYWLGLSPEE 

DS/YSWYESG*YNQ\PSAWVIRNAPDLNNMY 

CGYINRLYVQYYHCTYKQRMICEKMANPVQ 

LGSTYFREA 


797 


2147 


A 


6507 


1 


881 


PGSTHASARSQVPRSAGEAAPHSRRPPGLLPH 

APRAASAQLEERMRDPIIPGMTLQEGDCRGS 

QTVSLTMGTADSDEMAPEAPQHTHIDVHIHQ 

ESALAKLLLTCCSALRPRATQARGSSRLLVAS 

WVMQIVLGILSAVLGGFFYIRDYTLLVTSGA 

AI WTGAVAVLAGAAAFIYEKRGGTYWALLR 

TLLALAAFSTAIAALIO,WNEDFRYGYSYYNS 

ACRISSSSDWNTPAPTQSPEEVRRLHLCTSFM 

DMLKALFRTLQAMLLGVWILLLLASLTPLWL 

/SL/RGECSQPKG*VPKKRDQKEMLEVSGI*PG 

STHASARSQVPRSAGEAAPHSRRPPGLJLPHAP 

V SLTMGTAD SDEMAPE APQHTHI D VHIHQE S 

ALAKLLLTCCSALRPRATQARGSSRLLVASW 

VMQIVLGILSAVLGGFFYIRDYTLLVTSGAAI 

WTGAVAVLAGAAAFIYEKRGGTYWALLRTL 

LALAAFSTAIAALFXWNEDFRYGYSYYNSAC 

RISSSSDWNTPAPTQSPEEVRPXHLCTSFMDM 

LKALFRTLQAMLLGVWILLLLASLTPLWLYC 

WRMFPTKGVSP 


798 


2148 


A 


6528 


912 


2287 


VPNYLPSVSSAIGGEVPQRYVWRFCIGLHSAP 
RFLVAFAYWNFIYLSCTSPCSCYRPLCRJLNFG 
LNWENLALLVLTYVSSSEDF/TWVPG*GRSG 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcine s 
D-Aspartic Acid, E=GIutamic Acid, 
F=Phenylalanine, G=Giycine, H=Histidine, 
I=Isoleucine, K=Lysine, 1 .^Leucine, 
M=Methionine, N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threoninc, V=Valine 3 \V=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possibie nucleotide deletion, \=possible 
nucleotide insertion 














EVFPEGTGLPLPHSDLPTSWCGHSLQCGSQSS 

FPPAIHENAFIVFIASSLGHMLLTCILWRLTKK 

HTVSQEXDGLSLAGAPRQPRRKSRTSVLRIRV 

MVRWELS SNGNPGRG VLGLGLGLGNKLR W 

GQNLGL* HC V W V VWETGE* KR WRLQMGIE* 

GVASRRQ*VRNSVRGLVCHNSSAPPMYMGFF 

SPTVFGGGVGG*LHVTFILHPPEVEAAGIPLLL 

GPSLPQRQGREH1VVILAAPACAPFHDR*WEP 

REIRPSP*ELGLRGEPTLSYPASCRVIRQPIP*D 

RKSYSWKQRLFIINF1SFFSALAVYFRHNMYC 

EAG VYTIF AILE YTV VLTNMAFHMTAW WDF 

GNKELUTSQPEEKRF 


799 


2149 


A 


6529 


1 


874 


FFFFQRINFIEHSGSVSLLALACDLGWCEDWS 

CCLVQGGGDLVDVVQTNHGEDEAGGDTDSV 

DEARCKESQQEAQENLREDLCLESFAKDKIL 

QIIEGSEREHEETRTKQAALDGEPLGGGQLTA 

VHLHPSKEQQGQEGGERQRGARTHHWRGW 

EKGRRVRLRPPSGKLRADQPVRKLGGPTPS/T 

ELPG LQPH APTPHT A/P ATPT Y SPAPDTPNPP V 

RWKCPLPVEPRTRQLCRERTRKACPPKPRPPL 

GLPGDPTGPVTHHAPPVSPTGASGQERRAEP 

GAVSYAHASATK 


800 


2150 


A 


6544 


2 


662 


SAQRWAAVAGRWGCRLLALLLLVPGPGGAS 

EITFELPDNAKQCFYEDIAQGTKCTLEFQVITG 

GHYDVDCRLEDPDGKYLYKEMKKQYDSFTF 

TASKNGTYKFCFSNEXFSTFTHKTVYFDFQVG 

E\THLCFLVR/DRVSALTQMESACVSIHEALKS 

VIDYQTHFRLREAQGRSRAEDLNTRVAYWSV 

GEALILLVVS1GQVFLLKSFFSDKRTTTTRVGS 


801 


2151 


A 


6556 


1 


1319 


TPCMECIKGEGLREPQNLSGSQREPQTEGSM 

DGWRJRMPRWGLLLLLWGSCTFGLPTDTTTF 

KRIFLKRMPSIRESLKERGVDMAKLGPEWSQP 

MKRLTLGNTTSSVILTNYMDTQYYGEIGIGTP 

PQTFKWFDTGSSNV WVPS SKCSRLYTACVY 

HKLFDASDSSSYKHNGTELTLRYSTGTVSGFL 

SQDIITVGGITVTQMFGEVTEMPALPFMLAEF 

DGVVGMGFIEQ AIGRVTPIFDNIISQGVLKED 

VFSFYYNRDSENSQSLGGQIVLGGSDPQHYE 

GNFHYINLIKTGVWQTQMKGVSVGSSTLLCE 

DGCLALVDTGASYISGSTSSIEKLMEALGAKE 

KRLFDYVVKCNEGPTLPPTFLFLLGGKDTPLT 

SADYLFQESYSSKKLSTLAIHAMYIPPPTGPTL 

\ALGATF\IRKFYTEFDRGNNPHGFALAR 


802 


2152 


A 


6567 


13 


6147 


MCLGRMGASSPRSPEPVGPPAPGLPFCCGGSL 

LAVWLLALPVAWGQCNAPEWXLPFARPTNL 

TDEFEFPIGTYLNYECRPGY'SGRPFSIICLKNS 

VWTGAKDRCRRKSCRNPPDPVNGMVHVIKG 

IQFGSQIKYSCTKGYRLIGSSSATCIISGDTVIW 

DNETPICDRIPCGLPPTITNGDFISTNRENFHY 

GSVVTYRCNPGSGGRKVFELVGEPSIYCTSND 

DQVGIWSGPAPQCIIPNKCTPPNVENGLLVSD 

xtp^t F9I TMFVVFFRrOPGFVMKGPRRVKCOA 

LNKWEPELPSCSRVCQPPPDVLHAERTQRDK 

DNFSPGQEVFYSCEPGYDLRGAASMRCTPQG 

DWSPAAPTCEVKSCDDFMGQLLNGRVLFPV 

NLQLGAKVDFVCDEGFQLKGSSASYCVLAG 

MESLWNSSVPVCEQIFCPSPPVIPNGRHTGKP 

LEVFPFGKAVNYTCDPHPDRGTSFDLIGESTIR 

CTSDPQGNGVWSSPAPRCGILGHCQAPDHFL 

F AKLKTQTN ASDFPIGTSLKYECRPEYYGRPF 
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SEQ ID 

INC OI 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

JNL/. OI 

peptide 
seq- 
uence 


Met 
nou 


SEQ 

JU INvJ. 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond] 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteine, 
D=Aspartic Acid, E=Glutarnic Acid, 
F=Phenyl alanine, G=G!ycine, H-Histidinc, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon, 
/—possible nucleotide deletion. \=possible 
nucleotide insertion 














SITCLDNLVWSSPKDVCKRKSCKTPPDPVNG 

MVHVITDIQVGSRFNYSCTTGHRLIGHSSAECI ■ 

LSGNAAHWSTKPPICQRIPCGLPPTIANGDFIS 

TNRENFHYGSVVTYRCNPGSGGRKVFELVGE 

PSIYCTSNDDQVGIWSGPAPQCIIPNKCTPPNV 

ENGILVSDNRSLFSLNEVVEFRCQPGFVMKGP 

RRVKCQAJLNKWEPELPSCSRVCQPPPDVLHA 

ERTQRDKDNFSPGQEVFYSCEPGYDLRGAAS 

MRCTPQGDWSPAAPTCEVKSCDDFMGQLLN 

GRVLFP VNLQLG AK VDF VCDEGFQLKG SS AS 

YC VLAGMESL WNSS VP VCEQIFCPSPP VI PNG 

RHTGKPLEVFPFGKAVNYTCDPHPDRGTSFD 

LIGESTIRCTSDPQGNGVWSSPAPRCGILGHC 

QAPDHFLFAKLKTQTNASDFPIGTSLKYECRP 

EYYGRPFSITCLDNLVWSSPKDVCICRKSCKTP 

PDPVNGMVHVITDIQVGSRrNYSCTTGHRLIG 

HSSAECILSGNTAHWSTKPPICQRIPCGLPPTI 

ANGDFISTNRENFHYGSVVTYRCNLGSRGRK 

VFELVGEPS1YCTSNDDQVGIWSGPAPQCIIPN 

KCTPPNVENGILVSDNRSLFSLNEVVEFRCQP 

GFVMKGPRRVKCQALNKWEPELPSCSRVCQ 

PPPEILHGEHTPSHQDNFSPGQEVFYSCEPGY 

DLRGAASLHCTPQGDWSPEAPRCAVKSCDDF 

LGQLPHGRVLFPLNLQLGAKVSFVCDEGFRL 

KGSSVSHCVLVGMRSLWNNSVPVCEHIFCPN 

PPAJLNGRHTGTPSGDIPYGKEISYTCDPHPDR 

GMTFNLIGESTIRCTSDPHGNGVWSSPAPRCE 

LSVRAGFICKTPEQFPFASPTIPINDFEFPVGTS 

LNYECRPGYFGKMFS1SCLENLVWSSVEDNC 

RRKSCGPPPEPFNGMVHINTDTQFGSTVNYSC 

NEGFRLIGSPSTTCLVSGNNVTWDKKAP1CE1I 

SCEPPPTISNGDFYSKNRTSFHNGTVVTYQCH 

TGPDGEQLFELVGERSIYCTSKDDQVGVWSS 

PPPRC1STNKCTAPEVENAIRVPGNRSFFSLTEI 

IRFRCQPGFVMVGSHTVQCQTNGRWGPKLPH 

CSRVCQPPPEILHGEHTLSHQDNFSPGQEVFY 

SCEPSYDLRGAASLHCTPQGDWSPEAPRCTV 

KSCDDFLGQLPHGRVLLPLNLQLG AK VSF VC 

DEGFRLKGRSASHCVLAGMKALWNSSVPVC 

bt^ll'CPiNrrAJLNOKJi I G 1 PLGDIPYLiKEVSYT 

CDPHPDRGMTFNLIGEST1RRTSEPHGNGVWS 

SP APRCELP VG A A CPHPPKIQNGHYIGGHVSL 

YLPGMTISYTCDPGYLLVGKGFIFCTDQG1WS 

QLDHYCKEVNCSFPLFMNGISKELEMKKVYH 

YGDYVTLKCEDGYTLEGSPWSQCQADDRWD 

PPLAKCTSRTHDALIVGTLSGTIFFILLIIFLSWI 

ILKHRKGNNAHENPKEVAIHLHSQGGSSVHP 

RTLQTNEENSRVLP 


S03 


2153 


A 


6574 


2 


3233 


HGRSARLAAVPAEAMPGPRRPAGSRLRLLLL 
LLLPPLLLLLRG\SHAGNLTVAVVLPL ANTSY 
PWSWA\RVGPAVELALAQVKARPDLLPGWT i 

HNPA VFI XiPGC VYAAAP VGRFTAHWRVPLL 

TAGAPALGFGVKDEYAL'fTRAGPSYAKLGDF 

VAALHRRLGWERQALMLYAYRPGDEEHCFF 

LVEGLFMRVRDRLNITVDHLEF.AEDDLSHYT 

RLLRTMPRKGRVIYICSSPDAFRTLMLLALEA 

GLCGED Y VFFHLDIFGQSLQG G QGP APRRP W 

ERGDGQDVSARQAFQAAKIITYKDPDNPEYL 

EFLKQLKHLAYEQFNFTMEDGLVNTIPASFH 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine s 
D=Aspartic Acid, E=Glutamic Acid, 
FHPhenyialanine, G=Glycine, H==Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methtonine, N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X= Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














DGLLLYIQAVTETLAHGGTVTDGEND QRMW 

NRSFQGVTGYLKIDSSGDRETDFSLWDMDPE 

NGAFRVVLNYNGTSQELVAVSGRKLNWPLG 

YPPPDIPKCGFDNEDPACNQDHLSTLEVLALV 

GSLSLLGILIVSFFIYRKMQLEKELASELWRVR 

WEDVEPSSLERHLRSAGSRLTLSGRGSNYGSL 

LTTEGQFQVFAKTAYYKGNLVAVKRVNRKR 

IELTRKVLFELKHMRDVQNEHLTRFVGACTD 

PPNICILTEYCPRGSLQDILENESITLDWMFRY 

SLTNDIVKGMLFLHNGAICSHGNLKSSNCVV 

DGRFVLKITDYGLESFRDLDPEQGHTVYAKK 

LWTAPELLRMASPPVRGSQAGDVYSFGIILQE 

IALRSGVFHVEGLDLSPKEUERVTRGEQPPFR 

PSLALQSHLEELGLLMQRCWAEDPQERPPFQ 

QIRLTLRKFNRENS SNILDNLL SRMEQ Y ANNL 

EELVEERTQAYLEEKRKAEALLYQILPHSVAE 

QLKRGETVQAEAFDSVTIYFSDIVGFTALSAE 

STPMQVVTLLNDLYTCFDAVIDNFDVYKVET 

IGDAYMVVSGLPVRNGRLHACEVARMALAL 

LDAVRSFRIRHRPQEQLRLRJGIHTGPVCAGV 

VGLKMPRYCLFGDTVNTASRMESNGEAL\KI 

HLSS\ETKAVL\EEFGGFELELRGDVEMKGKG 

KVRTY WLLGERG SSTRG 


804 


2154 


A 


6585 


2 


3837 


DAPGRPPVRLPTMELEDGVVYQEEPGGSGAV 

MSERVSGLAGSIYREFERLIVRYDEEVVKELIP 

LVVAVLENLDSVFAQDQEHQVELELLRDDNE 

QLITQYEREKALRKHAEEKFIEFEDSQEQEKK 

DLQTRVESLESQTRQLELKAKNYADQISILEE 

RE AELKKE YN ALHQRHTEMIHN YMEHLERT 

KLHQLSGSDQLESTAHSRIJUCERPISLGIFPLP 

AGDGLLTPDAQKGGETPGSEQWKFQELSQPR 

SHTSLKDELSDVSQGGSKATTPASTANSDVA 

TIPTDTPLKEENEGF VKVTDAPNKSEISKH IE V 

QVAQETRNVSTGSAENEEKSEVQAIIESTPEL 

DMDKDLSGYKGSSTPTKGIENKAFDRNTESL 

FEELS SAGSGLIGDVDEGADLLGMGREVENLI 

LENTQLLETKNALNIVKNDL1AKVDELTCEK 

DVLQGELEAVKQAJCLKLEEKNRELEEELRKA 

RAEAEDARQKAKDDDDSDIPTAQRKRFTRVE 

MARVLMERNQYKERLMELQEAVRWTEMIR 

ASRENPAMQEKKRSSIWQFFSRLFSSSSNTTK 

KPEPPVNI.KYNAPTSHVTPSVKKRSSTLSQLP 

GDICSKAFDFLSEETEASLASRREQKREQYRQ 

VKAm'QKEDGRVQAFGWSLPQKYKQVTNG 

QGENKMKNLPVPVYLRPLDEKDTSMKLWCA 

VGVNLSGGKTRDGGSWGASVFYKDVAGLD 

TEGSKQRSASQSSLDKLDQELKEQQKELKNQ 

EELSSLVWICTSTHSATKVLIIDAVQPGN1LDS 

FTVCN SHVLCIASVPG ARETDYPAGEDLSESG 

QVDKASLCGSMTSNSSAETDSLLGGITVVGC 

SAEGVTGAATSPSTNGASPVMDKPPEMEAEN 

QPvnFMVPTAFF\ATEATEGNAGSAEDTV\DIS 

QTGVYTEHVFTDPLGWQIPEDLSPVYQSSND 

SDAYKDQISVLPNEQDLVREEAQKMSSLLPT 

MWLGAQNGCLYVHSSVAQWRKCLHSIKLKD 

SILSIVHVKGIVLVALADGTLAIFHRGVDGQW 

DLSNYIILLDLGRPiniSlRCMTVVHDKVWCG 

YRNKIYVVQPKAMKIEKSFDAHPRKESQVRQ 

LAWVGDGVWVSIRLDSTLRLYHAHTYQHLQ 

DVDIEPYVSKMLGTGKLGFSFVRITALMVSC 
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SEQ ID 
or 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

l\\J. 01 

peptide 
seq- 
uence 


Met 
noa 


SEQ 

ID Ss\J. 

in 

USSN 
09/496 

Q1 4 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
u— as panic acio, Jt=uiutamic Acid, 
F=Phenylalanine 3 G=Glycine, H^Histidine, 
NIsoleucine, K=Lysine, L=Leucine f 
M=Methionine, N=Asparagine 3 P=Proline, 
i^ = vjiuiamine, tv— Arginine, o— oenne, 
T=Threonine, V=Valine, W=Tryptophan. 
Y=Tyrosine } X=Unknown. *=Stop codon, 
/^possible nucleotide deletion, \=possib!e 
nucleotide insertion 














NRLWVGTGNGV1IS1PLTETVILHQGRLLGLR 
ANKTSGVPGNRPGSV1RVYGDENSDKVTPGT 
FIPYCSMAHAQLCFHGHRDAVKFFVAVPGQV 
ISPQSSSSGTDLTGDKGRGHLHRSLVVRRP 


805 


2155 


A 


6605 


469 


2602 


FGRLLWGTAFKSWKMKAPIPHLILLYATFTQ 

SLKVVTKRGSADGCTDWSIDIKKYQVLVGEP 

VRIKCALFYGYIRTNYSLAQSAGLSLMWYKS 

SGPGDFEEPIAFDGSRMSKEEDSIWFRPTLLQ 

DSGLYACVIRNSTYCMKVSISLTVGENDTGL 

C YN S KMK YFEKAEL SKSKEI S CRDIEDFLLPT 

REPEILWYKECRTKTWRPSIVFKRDTLLIREV 

REDDIGNYTCELKYGGFVVRRTTELTVTAPL 

TDKPPKLLYPMESKLTIQETQLGDSANLTCRA 

FFGYSGDVSPLIYWMKGEKFIEDLDENRVWE 

SD1\K1LKEHLGEQEVSISLIVDSVEEGDLGNYS 

CYVENGNGRRHASVLLHKRELMYTVELAGG 

LGAILLLLVCLVT1YKCYKIEIMLFYRNHFGA 

EELDGDNKDYDAYLSYTKVDPDQWNQETGE 

EEREALEILPDMLEKHYGYKLFIPDRDLIPTGT 

YIEDVARCVDQSKRLIIVMTPWVVRRGWSIF 

ELETRLRNMLVTGE1KVILIECSELRGIMNYQE 

VEALKHTIKLLTVIKWHGPKCNKLNSKFWKR 

LQYEMPFKRIEPITHEQALDVSEQGPFGELQT 

VSA I SMAAATSTAL ATAHPDLRSTFHNTYH S 

QMRQKHYYRSYEYDVPPTGTLPLTSIGNQHT 

YCNIPMTLINGQRPQTKSSREQNPDEAHTNSA 

1LPLLPRETSISSVIW 


806 


2156 


A 


6614 


3 


1584 


NSARGGVGVRGARAMATVQEKAAALNLSAL 

HSPAHRPPGFSVAQKPFGATYVWSSIINTLQT 

QVEVKKRRHIU.KRHNDCFVGSEAVDVIFSHL 

IQNKYFGDVDIPRAKVVRVCQALMDYKVFE 

AVPTKVFGKDI-CKPTFEDSSCSLYRFTTIPNQD 

SQLGKENKLYSPARYADALFK-SSDIRSASLED 

LWENLSLKPANSPHVNISTTLSPQVINEVWQE 

ET1GRLLQLVDLPLLDSLLKQQEAVPKIPQPK 

RQSTMVNSSNYLDRGILKAYSDSQEDEWLSA 

AIDCLEYLPDQMVVEISRSFPEQPDRTDLVKE 

LLFDAIGRYYSSREPLLNHLSDVHNGIAELLV 

NGKTE1ALEATQLLLKLLDFQNREEFRRLLYF 

MAVAANPSEFKLQKJESDNRMVVICRIFSKATV 

DNKNLSKGKTDLLVLFLVMDHQKDVFKIPGT 

L\HKIVS\VK\LMAIQNGRDPNRDAGYIYCQRI 

DQRDYSNITEKTTIDELLYLLKTLDEDSKLSA 

ICEKia<C\LLGQFYKCHPDIFIEI IFGD 


807 


2157 


A 


6615 


4198 


2094 


FGIVGTFALETDELDSDRDPAIFSLCDFGAMR 

PQILLLL AJLLTI .GL AAQHQDK.WCKM/VKML 

CPDRVDKKVSCQVLGLLQVPSVLPPDTETLD 

LSGNQLRSILASPLGFYTALRHLDLSTNEISFL 

QPGAFQALTHLEIILSLAHNRLAMATALSAG 

GLGPLPRVTSLDLSGNSLYSGLLERLLGEAPS 

LHTLSLAENSLTRTTRHTFRDMPALEQLDI.HS 

FSLQQLRVLDLSCNSIEAFQTAS\QPQAEFQLT 

WLDLRENKLLHFPDLAALPRLIYLNLSNNLIR 

LPTGPPQDSKGIHAPSEGWSALPLSVAPSGNAS 

GRPLSQLLNLDLSYNEIEUPDSFLEHLTSLCFL 

NLSRNCLRTFEARRLGSLPCLMIXDLSHNALE 

TLELGARALGVSLRTLLLQGNALRDLPPYTFA 

NLASLQRLNLQGNRVSPCGGPDEPGP\SGCV\ 

AFSGITSLRSLSLVDNEIELLRAGAFLHTPLTE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine OCysteine, 
D=Aspartic Acid, b=vjrlutarnic /\ciu, 
F=PhenyIalanine, G=Glycine ? H=Histidine, 
I=lsolcucinc, K=Lysine, L=Leucine, 
M-Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine s 
T=Threonine, V=Vaiine, W=Tryptophan s 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possiDie nucicuuuc ucicuuii, * ^umiui^ 
nucleotide insertion 














LDLSSNPGLEVATGALGGLEASLEVLALQGN 

GLMVLQVDLPCF1CLKRLNLAENRLSHLPAW 

TQAVSLEVLDLRNNSFSLLPGSAMGGLETSLR 

RLYLQGNPLSCCGNGWLAAQLHQGRVDVDA 

TQDLICRFSSQEEVSLSHVRPEDCEKGGLKNI 

NLIIILTFILVSAILLTTLAACCCVRRQ1CFNQQ 

YKA 


808 


2158 


A 


6619 


153 


1852 


FKALSQYIYTNTHLEREAAFEVAILLRRMEEG 

ARHRNNTEKKHPGGGESDASPEAGSGGGGV 

ALKKEIGLVSACG11VGNIIGSGIFVSPKGVLEN 

AGSVGLAL1VWIVTGF1TVVGALCYAELGVNI 

PKSGGDYFYVKD1FGGLAGFLRLWIAVLVIYP 

TNQAVIALTFSNYVLQPLFPTCFPPESGLRLLA 

AICLLLLTWVNCSSVRWATRVQDIFTAGKLL 

ALALIHMGIVQICKGEYFWLEPKNAFENFQEP 

DIGLVALAFLQGSFAYGGWNFLNYWTEELV 

_^ « rr71 TT , t-it> a it-»TC<TT»\T "v /TT\ /VX/Tr A "\T\/ /AT "VWT 1 

DP\YKNL\PRAIFIS1P\LV irVY VrANV/AL i v i 

AMSPQELVLASVNAVAVTFGEKLLGVMAWIM 

PISVALSTFGGVNGSLFTSSRLFFAGAREGHLP 

SVLAM1HVKRCTP1PALLFTCISTLLMLVTSD 

MYTLINYVGFINYLFYGVTVAGQIVLRWKKP 

DIPRPIKTNLLFP11YLLFWAFLLVFSLWSEPVV 

CGIGLA1MLTGVPVYFLGVYWQHKPKCFSDFI 

ELLTLVSQKMCVVVYPEVERGSGTEEANED 

MEEQQQPMYQPTPTKDKDVAGQPQP 


809 


2159 


A 


6621 


1041 


223 


QD S RKMLP S I S V N S JL V Q U N U V L, N a KDAAKH 

TAGAKRYKYLRRLFRFRQMDFEFAAWQMLY 

LFTSPQRVYRNFHYRICQTKDQWARDDPAFL 

VLLS1WLCVST1GFGFVLDMGFFETIKLLLWV 

VLIDCVGVGLL1ATLMWFISNKYLVICRQSRD 

YDVEWGYAFDVHLNAFYPLLVILHFIQLFF1N 

HVILTDTFIGYLVGNTLWLVAVGYYIYVTFL 

GYSVGLLFFSVALPFLKNTVILLYPFAPLILLYG 

LSLALGWNr irll LCbr iK) Kviv 


810 


2160 


A 


6623 


160 


822 


SPASGHCRLNGAAVAMFGCLVAGRLVQTAA 
QQVAEDKFVFDLPDYESINHVVVFMLGTIPFP 
EGMGGSVYFSYPDSNGMPVWQLLGFVTNGK 
PSAIFKISGLKSGEGSQHPFGAMNIVRTPSVAQ 
IGISVELLDSMAQQTPVGNAAVSSVDSFTQFT 
QKMLDNFYNFASSFAVSQ/VPDDTQ/RPSEMF 
IPANVVLKWYENFQRRTSTEPSLLENIIW1KIN 
F 


811 


2161 


A 


6627 


18 


3367 


LEG SLNTERAKY YLTITMPHFTVTK VEDPEEG 
AAASISQEPSLADIKARIQDSDEPDLSQNSITG 
EHSQLLDDGHKKARNAYLNNSNYEEGDEYF 
DKNLALFEEEMDTRPKVSSLLNRMANYTNLT 
QG AKEHEE AENITEGKKXPTKTPQMGTFMG 
VYLPCLQNIFGVILFLRLTWVVGTAGVLQAF 
AIVLICCCCTMLTAISMSAIATNGWPAGGSY 
FM1SRALGPEFGGAVGLCFYLGTTFAAAMYIL 
GAIEIFLVYIVPRAAIFHSDDALKESAAMLNN 
MRVYGTAFLVLMVLVVFIGVRYVNKFASLFL 
ACVIVSILAIYAGAIKSSFAPPHFPVCMLGNRT 
LSSRHIDVCSKTKEINNMTVPSKLWGFFCNSS 
QFFNATCDEYFVHNN\ / TSIQGIPGLASG1ITEN 
L WSNYLPKGEUEKPS AKSSD VLG SLNHEYVL 
VDITTSFTLLVGIFFPSVTG1MAGSNRSGDLKD 
AQKSIPIGT1LAILTTSFVYLSNVVT.FGACTEGV 
VLRDKFGDAVKGNLVVGTLSWPSPWVIV1GS 
1 FFS TCGAGLQSLTG APRLLQA1AKDNIIPFLRV 
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SEQ no 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

becinnine 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, 
D— Aspartic Acid, E^GIutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isolcucinc, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














FGHSKANGEPTWALLLTAAIAELGIL1ASLDL 

VAPILSMFFLMCYLFVNLACALQTLLRTPNW 

RPRFR Y YH W AL SFMGMSICL ALMFISS WYYA 

1VAMVIAGMIYKYIEYQGAEKEWGDGIRGLS 

LSAARFALLRLEEGPPHTKNWRPQLLVLLKL 

DEDLHVKHPRLLTFASQLKAGKGLTIVGSVIV 

GNFLENYGEALAAEQTIKHLMEAEK VKGFCQ 

LV V AAKLREGI SHLIQSCGLGGMKHNTV VM 

GWPNGWRQSEDARAWKTFIGTVRVTTAAHL 

ALLVAKNISFFPWVFOF^FnNTDYWWTVHDn 

GMLMLLPFLLK\QHKVWRKCSIRFF\TVAQLE 

DNSIQMKKDLATFLYHLRIEAEVEVVEMHDS 

DTS A YT YERTLMMEQRSQMLR H MRLSKTER 

DREAQLVKJDRNSMLRLTSIGSDEDEETETYQ 

EKVHMTWTKX)KYN4ASRGQKAKSMEGFQDL 

LNMRPDQSNVRRMHTAVKLNEVIVNKSHEA 

KLVLLNMPGPPRNPEGDFNYMEFI FVI TFG1 

ERVLLVRGGGSEVITIYS 


812 


2162 


A 


6628 


66 


640 


AVCTMSEMAELSELYEESSDLQMDVMPGEG 

DLPQMEVGSGSRELSLRPSRSGAQQLEEEGP 

MEEEEAQPMAAPEGKRSLANGPNAGEQPGQ 

VAGADFESEDEGEEFDDWEDDYDYPEEEQLS 

GAGYRVSAALEEADKMFLRTREPALDGGFQ 

MH YEKTPFDQL AFIEELF\SL M V VNRLTEE LG 

CDEIIDRE 


813 


2163 


A 


6630 


708 


1355 


AKMGAYKYIQELWRKKQSDVMRFLLRVRC 

WQYRQLSALHRAPRPTRPDKARRLGYKAKQ 

GY/VYIYIGFVFAVIYRIRVRRGGRKRPVPKG 

ATYGKPVHHGVNQLKFARSLQSVAEERAGR 

HCGALRVLNSYWVGEDSTYKFFEVILIDPFHK 

AIRRNPDTQWITICPVHKHREMRGLTSAGRKS 

RGLGKGHKFHHTIGGSRRAAWRRRNTLQLH 

RYR 


814 


2164 


A 


6635 


201 


1705 


KGTEMNKSRWQSRRRHGRRSHQQNPWFRLR 

DSEDRSDSRAAQPAHDSGHGDDESPSTSSGT 

AGTSSVPELPGFYFDPEKKRYFRLLPGHNNCN 

PLTKESIRQK£MESKRLRLLQEEDRI<KKIARM 

GFNASSMLRKSQLGFLNVTNYCHLAHELRJLS 

CMERKJCVQIRSMDPSALASDRFNLILADTNS 

DRLFTVNDVTVGGSKYGIINLQSLKTPTLKVF 

MHENLYFTNRKVXNSVCWASLNHLDSHILLC 

LMGLAETPGCAT1XPASLFVNSHPAGIDRPG\ 

MLCSFRIPG AW SC A WSLN 1QANNCFSTGLSR 

RVLLTNVVTGHRQSFGTNSDVLAQQFALMA 

PLLFNGCRSGEIFAIDLRCGNQGKGWKATRLF 

HDSAVTSVRILQDEQYLMASDMAGKIKLWD 

LRTTKC VRQYEGHVNE Y A YLPLH VHEEEGTT , 

VAVGQDCYTRIWSLHDARLLRTIPSPYPASKA 

DIPSVAFSSI^LGGSRGAPGLLIVIAVGQDLYCY 

SYS 


815 


2165 


A 


6643 


659 


3282 


NKNILEVPSARTTRIMGDHLDLLLGWLMAG 

PVFGIPSCSFDGRIAFYRFCNLTQVPQVLNTTE 

RLLLSFNYIRTVTASSFPFLEQLQLLELGSQYT 

PLTIDKEAFRNLPNLR1LDLGSSKJYFLHPDAP 

QGLFHLFELRLYFCGLSDAVLKDGYFRNLKA 

LTRLDLSKNQI RSL YLHPSFGKLN SLKSIDFS S 

NQIFLVCEHELEPLQGKTLSFFSLAANSLYSR 

VSVDWGKCMNPFRNMVLEILDVSGNGWTV 

D1TGNFSNAISKSQAFSLILAHHIMGAGFGFHN 

IKDPDQNTF'AGLARSSVRHLDLSHGFVFSLNS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Aminoacid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid. E=Glutarnic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine s L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine 3 V= Valine, W=Tryptophan. 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














RVFETLKDLKVLNLAYNKJNKIADEAFYGLD 

NLQVLNLS YNLLGEL Y S SNFYGLPK V A Y I DL 

QKNHIAIIQDQTFKFLEKXQTLDLRDNALTTIH 

FIPS1PDIFLSGNKLVTLPKINLTANLIHLSENR 

LENLDILYFLLRVPHLQILILNQNRFSSCSGDQ 

TPSENPSLEQLFLGENMLQLAWETELCWDVF 

EGLSHLQVLYLNHNYLNSLPPGVFSHLTALR 

GLSLNSNRLTVLSHNDLPANLEILDISRNQLL 

APNPDVFVSLSVLDITHNKFICECELSTF1NWL 

NHTNVTlAGPPADIYCVYPDSLSGVSLFSLSTb 

GCDEEEV1 XSLKFSLFIVCTVTLTLFLMTILTV 

TKFRGFCFICYKTAQRLWKDHPQGTEPDMY 

KYDAYLCFSSKDFTWVQNALLKHLDTQYSD 

QNRFNLCFEERDFVPGENRPXANIQDAIWNSR 

KIVCLVSRHFLRDGWCLEAFSYAQGRCLSDL 

NSALIMWVGSLSQYQLMKHQSIRGFVQKQQ 

YLRWPEDLQDVGWFLHKLSQQILKKEKEKK 

KDNMPLQTVATIS 


816 


2166 


A 


6646 


1 


3811 


RDRAG VRPAGKQHAA/VAF YD VG GDRP WDS 

GNTQLPPRNPVKANAMFGAGDEDDTDFLSPS 

GGARLASLFGLDQAAAGHGNEFFQYTAPKQP 

KKGQGTAATGNQATPKTAPATMSTPT1LVAT 

AVHAYRYTNGQYVKQGKFGAAVLGNHTTR 

EYRILLYISQQQPVTVARIHVNFELMVRPNNY 

STFYDDQRQNWSIMFESEKAAVEFNKQVCIA 

KCNSTSSLDAVLSQDLIVADGPAVEVGDSLE 

VAYTGWLFQNHVLGQVFD STANKDICLLRLK 

LGSGKVIKGWEDGMLGMKKGGKRLLIVPPA 

CAVGSEGVIGWTQATDSILVFEVEVRRVKIA 

KDSGSDGHSVSSRDSAAPSPIPGADNLSADPV 

VSPPTSIPFKSGEPALRTKSNSLSEQLAINTSPD 

AVKAKL1SRMAKMGQPMLPILPPQLDSNDSEI 

EDVNTLQGGGQPVVTPSVQPSLQPAHPALPQ 

MTSQAPQPSVTGLQAPSAALMQVSSLDSHSA 

VSGNAQSFQPYAGMQAYAYPQASAVTSQLQ 

PVRPLYPAPLSQPPHFQGSGDMASFLMTEAR 

QHNTEIRMAVSKVADKMDHLMTKVEELQKH 

SAGNSMLIPSMSVTMETSMIMSNIQRIIQENER 

LKQE1LEKSNRIEEQNDKISELIERNQRYVEQS 

NLMMEKRNNSLQTATENTQARVLHAEQEKA 

KVTEELAAATAQVSHLQLKMTAHQKKETEL 

QMQLTESLKETDLLRGQLTKVQAKLSELQET 

SEQAQSKFKSEKQNRKQLELKVTSLEEELTDL 

RVEKESLEKNLSERKKKSAQERSQAEEEIDEI 

RKSYQEELDKXRQLLKKTRVSTDQAAAEQLS 

LVQAELQTQWEAKCEHLLASAKDEHLQQYQ 

EVCAQRDAYQQKLVQLQEKSVCFA\CLALQA 

QITALTKQNEQH1RELEKNKSQMSGVEAAAS 

DPSEKVKKIMNQVFQSLRREFELEESYNGRTI 

LGTIMNTIKMVTLQLLNQQEQEKEESSSEEEE 

EKAEERPRRPSQEQSASASSGQPQAPLNRERP 

FSPMVPSEOVVEEAVPLPPOALTTSQDGHRR 

KGDSEAEALSEIKDGSLPPELSCIPSHRVLGPP 

TS iPPEPLGP V SMDSECEESL AASPMAAKXPDN 

P SGKWCVREV APDGPLQES STRLSLTSXDPEE 

GDPLALGPESPGEPQPPQLKICDDVTSSTGPHK 

ELSSTEAGSTVAGAALRPSHHSQRSSLSGDEE 

DELFKGATLKALRPKAQPEEEDEDE V SMKGR 

PPPTPLFGDDDDDDDLDWLG 


817 


2167 


A 


6649 


63 


1073 


FFRSSSDNGSPIRQYE/HSTPAHQGPVMGLEG 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
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Met 
hod 


SEQ 

in no- 
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in 
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Predicted 

nAfrinntnn 

ucgi mi iii^ 

nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C=Cysteine, 
0 = Aspartic Acid. E^Glutamic Acid, 
P=PhenylaIanine, G=G!ycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=^Tyrosine, X=Unknown, *=Stop codon, 
/=possib!e nucleotide deletion, \=possible 
nucleotide insertion 














KS/ARNSQLRIYLVGKTGAGKSATGNSILGRK 
vrriju l /n_/\Jvol livis.UclvKo3o WiVjk i xii-» V V VLs 

TPGIFDTEWNAETSKEIIRCILLTSPGPHALLL 

WPLGRYTEEEHKATEIGLKMFGERARSFMIL 

IFTRKDDLGDTNLHDYLREAPEDIQDLMDIFG 

DRYCALNNKATGAEQEAQRAQLrXirJQRW 

RENKEGCYTNRaMYQRAEEEIQKQTQAMQEL 

HRVELEREKARIREEYEEKIRKLEDK\^EQEKR 

KKQMEKKLAEQEAHYAVRQQRARTEVESKD 

GILELIMTAJLQ1ASFILLRLFAED 


S18 


2168 


A 


6660 


357 


1890 


APSGSWTRVVLTLDPCSLRSRSPRSLLDPGMP 

GISARGLSHEGRKQLAVNLTRVLALYRSILDA 

YIIEFF\TDNL WDTLPCS WQE ALDGLKPPQL A 

TMLLGMPGEGEVVRYRSVWPLTLLALKSTA 

CALAFTRMPGFQTPSEFLENPSQSSRLTAPFR 

KHVRPKKQHEIRRLGELVKKLSDFT/GLHPGC 

RRGLRPGVHLSRFMALGLGLMVKSIEGDQRL 

VERAQRLDQELLQALEKEEKRNPQVVQTSPR 

HbFliri V VKW vL>l J J ALClir,LLLrLbNPCQGRA 

RLLLTGLHACGVDLSVALLRHFSCCPEVVALA 

SVGCCYMKLSDPGGYPLSQWVAGLPGYELP 

YRLREGACHALEEYAERLQKAGPGLRTHCY 

RAALEWIRRARPELRRPGVQGIPRVHELKIEE 

YVQRGLQRVGLDPQLPLNLAALQAHLAQEN 

RVVA1-FSLALLLAPLVETL1LLDRLLYLQEQA 

LSP\GFHAELLPIFSPELSPRNLVLVATKMPLG 

QALSVLETEDS 


819 


2169 


A 


6661 


65 


2686 


SGSGHCLAEAASMGPWGWKLRWTVALLLA 

AAGTAVGDRCERNEFQCQDGICCISYKWVCD 

GSAECQDGSDESQETCLSVTCKSGDFSCGGR 

VNRCIPQF WRCDGQ VDCDN G S DEQG CPPKTC 

SQDEFRCHDGKCISRQFVCDSDRDCLDGSDE 

ASCPVLTCGPASFQCNSSTCIPQLWACDNDPD 

CEDGSDEWPQRCRGLYVFQGDSSPCSAFEFH 

CLSGECIHSSWRCDGGPDCKDKSDEENCAVA 

TCRPDEFQCSDGNCIHGSRQCDREYDCKDMS 

DEVGCVNVTLCEGPNKFKCHSGECITLDKVC 

NMARDCRDWSDEPIKECGTNECLDNNGGCS 

HVCNDLICIGYECLCPDGFQLVAQRRCEDIDE 

CQDPDTCSQLCVNLEGGYKCQCEEGFQLDPH 

TKACKAVGSIAYLFFTNRHEVRKMTLDRSEY 

TSL1PNLRNVVALDTEVASNRIYWSDLSQRMI 

CSTQLDRAHGVSSYDTVISRDIQAPDGLAVD 

WIHSNIYWTDSVLGTVSVADTKGVKRKTLFR 

ENGSICPRAI\^\nDPVHGFMYWTDWGTPAKIK 

KGGLNGVDIYSLVTENIQWPNGITLDLLSGRL 

YWVDSKLHSISSIDWGGNRKTILEDEFCRLAH 

rroLA Vr.bLJJ\. V r W I UlINJiAlroAiNK-Ll ubUV 

NLLAENLLSPEDMVLFHNLTQPRGVNWCERT 

TLSNGGCQYLCLPAPQINPHSPKFTCACPDGM 

LLAR\DMRSCLTEG\EAAVATQETSTVRLKVS 

STAVRTOHTTTRPVPDTSRI PGATPGI TTVF1 

VTMSHQAI.GDVAG\RGN\EKJCPSSVRALSfVL 

PIVU.L VFLCLG VFLL WKN WRLKN1N SINFDNP 

VYQKTTEDEVHICHNQDGYSYPSRQMVSLED 

DVA 


820 


2170 


A 


6666 


17 


4146 


ERGISSQIKGMKSGSGGGSPTSLWGLLFLSAA 
LSLWPTSGEICGPGIDIRNDYQQLKRLENCTVT 
EGYLHILLISKAEDYRSYRFPKLTVITEYLLLF 
RVAGLESLGDLFPNLTVIRGWKLFYNYALVIF 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
jj—Aspartic /yciq, tj— vjiuidiiiiv. ayi,iu, 
F=Phenyialanine, G=G lycine, H=Histidinc, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine ? P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unkno\vn, *=Stop codon, 
/-possible nucleotide deletion, \=possible 
nucleotide insertion 














EMTNLKDIGLYNLRNITRG\AIRIEKNADLCYL 

ST VD W SL1LD A V SNN YI VGNKPPKECGDLCP 

GTMEEKPMCEKTTINNEYNYRCWTTNRCQK 

MCPSTCGKRACTENNECCHPECLGSCSAPDN 

DTACVACRHYYYAGVCVPACPPNTYRFEGW 

RCVDRDFC ANILS AESSDSEGFVIHDGECMQE 

CPSGF1RN GSQSM YCIPCEGPCPK VCEEEKKT 

KTIDSVTSAQMLQGCTIF1CGNLLINIRRGNNIA 

SELENFMGLIEVVTGYVKIRHSHALVSLSFLK 

NLRLILGEEQLEGNYSFYVLDNQNLQQLWD 

WDHRNLTIKAGKMYFAFNPKLCVSEIYRMEE 

VTGTKGRQSKGDINTRNNGERASCESDVLHF 

TSTTTSKNRIIITWHRYRPPDYRDLISFTVYYK 

EAPFKNVTEYDGQDACGSNSWNMVDVDLPP 

NKDVEPGILLHGLKPWTQYAVYVKAVTLTM 

VENDHIRGAKSEILYIRTNASVPS1PLDVLSAS 

NSSSQLIVKWNPPSLPNGNLSYYIVRWQRQP 

QDGYLYRHNYCSKDKIPIRKYADGTIDIEEVT 

ENPKTEVCGGEKGPCCACPKTEAEKQAEKEE 

AE Y RKVFENFLHN S1F VPRPERKRRD VMQ V A 

NTTMSSRSRNTTAADTYNITDPEELETEYPFF 

ESRVDNKERTVISNLRPFTLYRIDIHSCNHEAE 

KLGCSASNFVFARTMPAEGADDIPGPVTWEP 

RPENSIFLKWPEPENPNGLILMYEIKYGSQVE 

DQR EC V SRQE YRKYGG AKLNRLNPGN YTARI 

Q ATSLSGNG S WTDP VFFYVQ AKRYENFIHLII 

ALPVAVLLIVGGLVIMLYVFHRKRNNSRLGN 

GVLYASVNPEYFSAADVYVPDEWEVAREKIT 

MSRELGQGSFGMVYEGVAKGVVKDEPETRV 

AIKTVNEAASMRERIEFLNEASVMKEFNCHH 

WRLLGVVSQGQPTLVIMELMTRGDLKSYLR 

AYLNANKFVHRDLAARNCMVAEDFTVKIGD 

FGMTRDIYETDYYRKGGKGLLPVRWMSPESL 

KDGVFTTYSDVWSFGVVLWEIATLAEQPYQ 

GLSNEQVLRFVXMEGGLLDKPDNCPDMLFEL 

MRMCWQYNPKMRPSFLEIISSIKEEMEPGFRE 

VSFYYSEENICLPEPEELDLEPENMESVPLDPS 

ASSSSLPLPDRHSGHKAENGPGPGVLVLRASF 

npD opva WIUNHGRKNERALPLPOSSTC 


821 


2171 


A 


6691 


106 


825 


GRVLFRGCGVGHKGQVLMGTFILAQDWLSE 

SNHVFCVSSMLRLQKRLASSVLRCGKKKVW 

LDPNETNEIANANSRQQIRKLIKDGLIIRKPVT 

VHSRARCRKNTLARRK.GRHMGIGKRKGTAN 

ARA1PEKVTWMRRN1RILRRLLRRYRES/KRYR 

ESICK1DRJ-IMYHSLYLKVKGNVFKNKRILMEH 

IHFXKADKARKJKJLLADQAEARRSKTKEARK 


822 


2172 


A 


6715 


772 


21 


" DFRPGLLLPRKKKMFGFHKPKMYRSIEGaCI 
S G AKSS S S\RFTDSKRYEK\DFQ\SCFGLHETR\ 
S GDI\CN A\CVLL\LKR WKKLPAG SKK\N WNH 
VVDARAGPSVLKTTLKPKKVKTL\SGNRIK\ST 
QISKLQKEFKR\HNSDAHSTTS\SASP\AQSPLF 
TVNQFRWTGSDTGVGFPGSNRNHPVFSFLDLX 
TYWKRQK1CCG1MYKGRFGEVLIDTHLFKPCC 
SNKKAXAAEKPEEQGPEPLPISTQEWVTEVFM 


823 


2173 


A 


6727 


3 


4063 


' PYLATLQLDSSLLIPPKYQTPPAAAQGQATPG 
NAGPLAPNGSAAPPAGSAFNPTSNSSSTNPAA 
SSSASGSSVPPVSSSASAPGISQISTTSSSGFSGS 
VGGQNPSTGGISADRTQGNIGCGGDTDPGQS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteinc J 
D=Aspartic Acid, E-Glutamic Acid, 
F=Phenyla!anine 3 G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M^Methionine, N=Asparagine. P=Proline, 
Q=G I utamine, R-Arginine, S^Serine, 
T^Threonine, V^Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














SSQPSQDGQESNVPSVGSLADPDYLNTPQMN 

TPVTLNSAAPASNSGAGVLPSPATPRFSVPTP 

RTPRTPRTPRGGGTASGQGSVICYDSTDQGSP 

ASTPSTTRPLNSVEPATMQPIPEAHSLYVTLIL 

SDSVMN1FKDRNFDSCC1CACNMNIKGADVG 

LYIPDSSNEDQYRCTCGFSA1MNRKLGYNSGL 

FLEDELD1FGKNSDIGQAAERRLM\MCQSTFL 

PQ VEGTKKPQEPP1 SLLLLLQNQHTQPF ASLN 

FLDY1SSNNRQTLPCVSWSYDRVQADNNDY 

WTECFNALEQGRQYVDNPTGGKVDEALVRS 

ATVHSWPHSNVLDISMLSSQDWRMLLSLQP 

FLQDAIQK1CRTGRTWEN1QHVQGPLTWQQFH 

KMAGRGTYGSEESPEPLPIPTLLVGYDKDFLT 

ISPFSLPFWERLLLDPYGGHRDVAYIVVCPEN 

EALLEGAKTFFRDLSAVYEMCRLGQHKPICK 

VLRDGIMRVGKTVAQKLTDELVSEWFNQPW 

SGEENDNHSRI.KLYAQVCRHHLAPYLATLQL 

DSSLLEPPKYQTPPAAAQGQATPGNAGPLAPN 

GSAAPPAGSAFNPTSNSSSTNPAASSSASGSSV 

PPVSSSASAPGISQISTTSSSGFSGSVGGQNPST 

GGISADRTQGNIGCGGDTDPGQSSSQPSQDG 

QESVTERERIGIPTEPDSADSHAHPPAVVIYM 

VDPFTYAAEEDSTSGNFWLLSLMRCYTEMLD 

NLPEHMRNSFILQIVPCQYMLQTMKDEQVFY 

1QYLKSMAFSVYCQCRRPLPTQIHIKSLTGFGP 

AASIEMTLKNPERPSP1QLYSPPFILAPIKDKQT 

ELGETFGEASQKYNVLFVGYCLSHDQRWLL 

ASCTDLHGELLETCVVNIALPNRSRRSKVSAR 

K1GLQKLWEWCIGIVQMTSLPWRWIGRLGR 

LGHGELKDWSILLGECSLQTISKKXKDVCRM 

CGISAADSPS1LSACLVAMEPQGSFWMPDAV 

TMGSVFGRSTALNMQSSQLNTPQDASCTHIL 

VFPTSSTIQVAPANYPNEDGFSPNNDDMFVDL 

PFPDDMDNDIGILMTGNLHSSPNS SP VPSPGSP 

SGIGVGSHFQHSRSQGERLLSREAPEELKQQP 

LALGYFVSTAKAENLPQWFWSSCPQAQNNQC 

PLFLKASLHHHISVAQTDELLPARNSQRVPHP 

LDSKTTSD VLRFVLEQ YN ALS WLTCNPATQD 

RTSCLP VI IF V VLTQL YN AIMNIL 


824 


2174 


A 


6732 


2440 


365 


VEEGLGRRRTPPGGRRGPVTPARPGPDSVRR 

RLLPPSSAAAFSSHRHNLLCSRRRGGGGGGG 

GGGGG'niCRPGlTGPTAATSPSGEPGNAASAP 

LSLLSPFPGQTTYQHPGVAEPSAYGGRDVAC 

ASLVFGRLQHRGGDRKRGLLGRSSGDAASD 

QPFRCRSGSTAGRLVKQMDFTEAYADTCSTV 

GLAAREGNVKVLRKLLKKGRSVDVADNRG 

WMPIHEAAYHNSVECLQMLINADSSENYIKM 

KTFEGFCALHLAASQGHWK.IVQILLEAGADP 

NATTLEETTPLFLAVENGQIDVLRLLLQHGAN 

VNGSHSMCGWNSLHQASFQENAEIIKLLLRK 

GANKECQDDFGITPLFVAAQYG\KLESL\SILIS 

SGVANVNCQALDKATPI.FIAAQEGHTKCVELL 

LSSGADPDLYCNEDSWQLPItiAAAQMGHTKJ 

LDLLIPLTNRACDTGLNKVSPVYSAVFGGHE 

DCLEILLRNGYSPDAQACLVFGFSSPVCMAFQ 

KDCEFFGIVNILLKYGAQINELHLAYCLKYEK 

FSIFRYFLRKGCSLGPWNHIYEFVNHAIKAQA 

KYKEWLPHLLVAGFDPLTLIXNSWiDSVSlDT 

LIFTLEFTNWKTLAPAVERMLSARASNAWIL i 

QQHIATVPSLTHLCRLEIRSSLKSERLRSDSYIS 
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SEQ in 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ TD 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SHQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C^Cysteine, 

t> a ennrtif A r-iA rr — f^] i itcim ip Afirl 

u — A.spanic avciu, c — uniiduu^ 

F=Phenylalanine 5 G=Glycine, H=Histidine, 

I — Tcr\lf»tir»inr» TsT=T vQinr 1 — T Biicinc 

M=Methionine, N=Asparagine, P=Proline, 
QOlutamine, R=Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan, 
Y=Tyrosine, X=Unknown 5 *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














QLPLPRSLHNYLLYEDVLRMYEVPELAA1QD 
G 


825 


2175 


A 


6735 


111 


1252 


RIMGLFDRGVQMLLTTVGAFAAFSLMTIAVG 
TDYWLYSRGVCKTKSVSENETSKKNEEVMT 

ADTAEYFLRAVRASSIFPILSVILLFMGGLCIA 

ASEFYKTRHNIILSAG1FFVSAGLSN1IGIIVYIS 

ANAGDPSKSDSKKNSYSYGWSFYFGALSF11A 

EMVGVLAVHMFIDRHKQLRATARA\TDYLQ 

ASAITRIPSYRYRYQRRSRSSSRSTEPSHSRDA 

SPVGIKGFNTLPSTEISMYTLSRDPLKAATTPT 

ATYNSDRDNSFLQVHNCIQKENKDSLHSNTA 

NRRTTPV 


826 


2176 


A 


6744 


3 


5177 


SDDLRTGLFQDVQDAESLKLPGVYEVLFYNE 

TEDCPGMMLWRYPEPRGLTLVRITPVPFNTT 

EDPDISTADLGDVLQDPCSLEYWDELQKVFV 

AFREFNLSESKVCELQLPDINLVNDQKKLVSS 

DLWRIVLNSSQNGADDQSSASESGSQSTCDPL 

VTPTALAACTRVDSCFTPWFVPSLCVSFQFAH 

LEFHLCHHLDQLGTAAPQYLQPFVSDRNMPS 

ELEYMIVSFREPHMYLRQWNNGSVCQEIQFL 

AQADCIO.LECRNVTMQSVVICPFSIFGQMAVS 

SDVVEKLLDCTVIVDSVFVNLGQHVVHSLNT 

AIQAWQQNKCPEVEELVFSHFV1CNDTQETL 

RFGQVDTDENILLASLHSHQYSWRSHKSPQL 

LH1CIEGWGNWRWSEPFSVDHAGTFIRTIQYR 

GRTASLIIKVQQLNGVQKQIIICGRQI1CSYLSQ 

S1ELKVVQHYIGQDGQAVVREHFDCLTAKQK 

LPSYILENNELTELCVKAKGDEDWSRDVCLE 

SKAPEYSIVIQVPSSNSSIIYVWCTVLTLEPNS 

QVQQRMIVFSPLFIMRSHLPDPIIIHLEKRSLGL 

SETQIIPGKGQEKPLQN1EPDLVHHLTFQAREE 

YDPSDCAVPISTSUKQIATKVHPGGTVNQILD 

EFYGPEKSLQPIWPYNKKDSDRNEQLSQWDS 

PMRVKLS1WKPYVRTLLIELLPWALLINESKW 

DLWLFEGEKIVLQVPAGKIIIPPNFQEAFQ1GIY 

WAlsTTNTVHKSVAiKLVHNLTSPKWKDGGNG 

EVVTLDEEAFVDTEIRLGAFPGHQKLCQFCIS 

SMVQQGIQIIQIEDKTTIINNTPYQIFYKPQLSV 

CNPHSGKEYFRVPDSATFS1CPGGEQPAMKSS 

SLPCWDLMPDISQSVLDASLLQKQIMLGFSPA 

PGADSS QC WSLPAIVRPEFPRQSVAVPLGNFR 

ENGFCTRA1 VI TYQEHLG VTYLTLSEDPSPRV 

IIHNRCPVKMLIKEMKD1PKFEVYCKK1PSECS 

IHHELYHQISSYPDCKTKDLLPSLLLRVEPLDE 

VTTEWSDAIDINSQGTQYVFLTGFGYVYVDV 

VHQCGTVFITVAPEGKAGPILTNTNRAPEKIV 

TF/KMFITQLSLAWDDLTHHKASAELLRLTL 

DNIFLCVAPGAGPLPGEEPVAALFELYCVEIC 

CGDLQLDNQLYNKSNFHFAVLVCQGEKAEPI 

QCSKMQSLLISNKELEEYKEKCF1KLCITLNEG 

KSILCDINEFSFELKPARLYVEDTFVYYIKTLF 

DTYLPNSRLAGHSTHLSGGKQVLPMQVTQH 

ARALVNPVKLRKLVIQPVNLLVSIHASLKI.YI 

ASDHTPLSFSVFERGPIFTTARQLVHALAMHY 

AAGALFRAGWWGSLDILGSPASLVRSIGNG 

VADFFRLPYEGLTRGPGAFVSGVSRGTTSFVK 

HISKGTLTSITNLATSLARNMDRLSLDEEHYN 

r.QEEWRRQLPESLGEGLRQGLSRLGISLLGAI 

AGIVDQPMQNFQKTSEAQASAGHKAKGVISG 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

IIUtlCULlUC 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

F=Phenylalanine, G=Glycine, H=Histidine, 
I=Iso leucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine ? R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














VGKGIMGVFTKPIGGAAELVSQTGYGILHGA 

GLSQLPKQRHQPSDVVHADQAPNSHVKYVW 

KMLQSLGRPEVHMALDWLVRGSGQEHEGC 

LLLTSEVLFWSVSEDTQQQAFPVTEIDCAQD 

SKQNNLLTVQLKQPRVACDVEVDGVRERLSE 

QQYNRLVDYITKTSCHLAPSCSSMQIPCPWA 

AEPPPSTVKTYHYLVDPHFAQVFLSKFTIVfVK 

NKALRKGFP 


827 


2177 


A 


6748 


2 


1662 


FVGAPRRGNPFGSPGNPGRHQGPCHRPRGTK 

ASGVSPTLWRPQAAATGLEMPSSGRALLDSP 

LDSG SLTSLD S S VFC S EGEGEPL ALGDCFT VN 

VGGSRFVLSQQALSCFPHTRLGKLAVVVASY 

RRPGALAAVPSPLELCDDANPVDNEYFFDRS 

SQAFRYVLHYYRTGRLHVMEQLCALSFLQEI 

QYWGEDELSIDSCCRDRYFRRKELSETLDFKK 

DTEDQESQHESEQDFSQGPCPTVRQKLWNIL 

EI<PG SSTAARIFG VISIIFVG VSIINMALMSAEL 

RCRFLRKVPNTIDLLAn.PFYITLI.VESLSG\SQT 
TQEIAEN VGAHCPGCLRLLKALXRMLKA WGR 
HSTGLRSLGMTITQCYEEVGLLLLFLSVGISIF 
STVEYFAEQSIPDTTFTSVPCAWWWATTSMT 
TVG YGD1RPDTTTGKI V AFMCILSGIL VLALP1 
AIINDRFSACYFTLKLKE^VRQREALKKLTK 
NIATDSYISVNLRDVYARSIMEMLRLKGRER 
ASTRSSGGDDFWF 


828 


2178 


A 


67S6 


5672 


1360 


GTHP AS SGP VPLPPAA VS AATREE LGEP VPF V 

TASSGFQSMHSSNPKVRSSPSGNTQSSPKSKQ 

EVMVRPPTVMSPSGNPQLDSKFSNQGKQGGS 

ASQSQPSPCDSKSGGHTPKALPGPGGSMGLK 

NGAGNGAKGKGKRERSISADSFDQRDPGTPN 

DDSDIKECNSADHIKSQDSQHTPHSMTPSNAT 

APRS STPPHGQTTATEPTP AQKTPAKV VYVF S 

TEMANKAAEAVLKGQVETIVSFHIQNISNNK 

TERSTAPLNTQI S ALRNDPKPLPQQPP AP AN Q 

DQN S SQNTRLQPTPPIP AP APKP AAPPRPL DRE 

SPGVENKLIPSVGSPASSTPLPPDGTGPNSTPN 

NRAVTPVSQGSNSSSADPKAPPPPPVSSGEPPT 

LGENPDGLSQEQLEHRERSLQTLRDIQRMLFP 

DEKEFTGAQSGGPQQNPGVLDGPQKKPEGPI 

QAMMAQSQSLGKGPGPRTDVGAPFGPQGHR 

DVPFSPDEMVPPSMNSQSGTIGPDHLDHMTP 

EQIAWT .KLQQEFYEEKRRKPEQ WVQQCSLQ 

DMMVHQHGPRGVVRGPPPPYQMTPSEGWAP 

GGTEPFSDGINMPHSLPPRGMAPHPNMPGSQ 

MRLPGFAGMINSEMEGPNVPNPASRPGLSGV 

SWPDDVPKIPDGRNFPPGQGIFSGPGRGERFP 

NPQGLSEEMFQQQLAEKQLGLPPGMAMEGIR 

PSMEMNRAilPGSQRHMEPGNNPlFPRIPVEGP 

LSPSRGDFPKGIPPQMGPGRELEFGMVPSGM 

KGDVNLNVNMGSNSQMIPQKMREAGAGPEE 

MLKLRPGG SDMLPAQQKM VPLPFGEHPQQE 

YGMGPRPFLPMSQGPGSNSGLRNLREPI GPDQ 

RTNSRLSHMPPLPLNPSSNPTSLNTAPPVQRG 

LGRKPLDISVAGSQVHSPGINPLKSPTMHQVQ 

SPMLGSPSGNLKSPQTPSQLAGMLAGPAAAA 

SIKSPPVLGSAAASPVHLKSPSLPAPSPGWTSS 

PEPPLQSPGIPPNHKAPLTMASPAMLGNVESG 

GPPPPTASQPASVNIPG\SLPSSTPYTMPPEPTL 

SQNPLSIM\MSR\MSKFAM\PS\SNPGYNHDAI 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteme, 
D=Aspartic Acid, E=GIutamic Acid, 
^Phenylalanine, G=Glycinc, H=Histidinc, 
I=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, ^possible 
nucleotide insertion 














KTVASSDDDSPPARSPNLPSMNNMPGMGINT 
QNPR1SGPNPVVPMPTLSPMGMTQPLSHSNQ 
MPSPNAVGPNIPPHGVPMGPGLMSHNPIMGH 
GoQEPPM VPQOI<JVlvjr rQvjrrl Vv^bl I kjkivx rl 
HNGPS GG QG SFPG GMGFPGEGPLGRPSNLPQ 
SSADAALCKPGGPGGPDSFTVLGNSMPSVFT 
DPDLQEV1RPGATGIPEFDLSRIIPSEKPSQTLQ 
YFPRGEVPGRKQPQGPGPGFSHMQGMMGEQ 
ADDi/n at dp \ ,ir r.pnp\/rfTPF4TPi htap^x/p 

AJ J l\JV10j^AJ^roMOLTrvjr V vj I r UirL,\j i/vr oJVLr 

GHNPMRPPAFLQQGMMGPHIIRMMSPAQST 
MPGQPTLMSNPAAAVGMIPGKDRGPAGLYT 
HPGPVGSPGMMMSMQGMMGPXNRTS 


S29 


2179 


A 


6797 


433 


3 


ASFFNFSICICKlfLEVGPPVGHPAHDDVGGRH 

GPGGRyGSRSPRSLQCAPGGGRRSGCPAGSSP 

ASTCPPSPGGSGADRFGPSPPPPSREAAPTAG 

AAASSTSSGASCPPVPASSRWGVRSRTRSGSG 

GEREPRDRPSERPRLV 


S30 


2180 


A 


6800 


3 


1911 


LPEl^FGPRTPRAPRjmRRRLLLSPPPRPPPPL 

DREPRAPGPWLCPSRAGTAQDPARIRERRGR 

VAGGAAGPAMELRARGWWLLCAAAALVAC 

ARGDPASKSRSCGEVRQIYGAKGFSSS\DVPQ 

AEISGEHLRJCPQGYTCCTSEMEENLANRSHA 

ELETALRDSSRVLQAMLATQLRSFDDHFQHL 

LND S ERTLQ ATFPG AFGEL YTQN ARAFRDL Y 

SELRLYYRGANLHLEETLAEFWARLLERLFK 

QLHPQLLLPDDYLDCLGKQAEALRPF\GEAP\ 

RELRLRAT\RA\FVAAR\SFVQGLGVAS\DVVR 

KVAQVPLG\PEC\SRAV1EAGSYC/ALHCVGVP 

GARPCPDYCRNVLKGCLANQADLDAEWRNL 

LDSMVLITDKF WGTSCjVba VloIS V HI WJL/vt,A 

INALQDNRDTLTAKVIQGCGNPKVNPQGPGP 

EEKRRRGKLAPRERPPSGTLEKLVSEAKAQL 

RDVQDFWISLPGTLCSEKMALSTASDDRCWN 

GMARGRYLPEVMGDGLANQINNPEVEVDIT 

KPDMTIRQQIMQLKIMTNRLRSAYNGNDVDF 

QDASDDGSGSGSGDGCLDDLCGRKVSRKSSS 

SRTPLTHALPGLSEQEGQKTSAASCPQPPTFL 

LPLLLFLALTVARPRWR 


831 


2181 


A 


6808 


2 


1522 


ASRHGMTPGALLMLLGALGPPLAPGVRGSEA 

EGRU^KLFSGYDSSVRPAREVGDRVRVSVG 

LILAQLISLNEKDEEMSTKVYLDLEWTDYRLS 

WDPAEHDGIDSLRITAESVWLPDVVLLNNND 

GNFDVALDISVVVSSDGSVRWQPPGIYRSSCS 

IQVTYFPFDWQNCTMVFSSYSYDSSEVSLQT 

GLGPDGQGHQEIHIHEGTFIENGQWNIHKPS 

IU J IQPPGDPKGuRliGQKi H )c.vir YJLiiiusJvri^r i 

LVNVIAPCELITLLAIFVFYLPPDAGEKMGLSIF 

ALLTLTVFLLLLADKVPETSLSVP111KYLMFT 

N4VLVTFSVILSVWLNLHHRSPHTHQMPLWV 

RQIFIHKLPLYLRLKRPKPERDLMPEPPHCSSP 

GSGWGRGTDEYFIRKPPSDFLFPKPNRFQPEL 

SAPDLRRFIDGPNRAVALLPELREWSSISYIA 

RQLQEQEDHDALKEDWQFVAK1VVDRLFLW 

TFIIFTSVGTLW1FLDATYHLPPPDPFP 


832 


2182 


A 


6824 


71 


1079 


ETMAKNPPENCEDCHILNAEAFKSKICICKSLK 
ICGLVFGILALTLIVLFWGSKHFWPEVPKKAY 
DMEHTFYSNGEKKKIYMEIDPVTRTEIFRSGN 
GTDETLEVHDFKNGYTGIYFVGLQKCFIKTQI 
KV1PE1 SEPEEEIDENEE1TTTFFEQSVIWVPAE 
KPIENRDFLKNSKILEICDNVTMYW\INPTL\1S 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


iMet 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, 

D=A^nartie Acid P=nintnmic ApiH 

F=Phenylaianine. G=Glycine, F^Histidine, 
Wsoleucinc. K=Lysinc, L=Leucinc, 
M*=Methionine, N=Asparagine, P=Proline. 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














GTFAKQLHHNFAFIILVSELQDFEEEGEDLHFP 

ANEKKGIEQNEQWWPQVKVEKTRHARQAS 

EEELPrNDYTENGlEFDPMLDERGYCCIYCRR 

GNRYCRRVCEPLLGYYPYPYCYQGGRVICRV 

IMPCNWWVARMLGRV 


833 


2183 


A 


6846 


116 


602 


EAEGEQVCGAKCCGDAPHVENREEETARJGP 

GVMESKEERAJLNNLrVENVNQENDEKDEKE 

QVANICGEPLALPLNVSEYCVPRGNRRRFRVR 

QPILQYRWDIMHRLGEPQARMREENMERJGE 

E VRQLMEKLREKQLS H S LRA V STDPPHHDHI I 

DEFCVLMP 


S34 


2184 


A 


685 1 


3 


2024 


PNGVALLHI.PGAAVIPNTNYMFQDALGGRSR 

GSREESPAPSRAPASASLWRRLVVVEAKMAA 

HAAAAAQAAAAQAAHAE.AADSWYLALLGF 

AEHFRTSSPPKIRLCVHCLQAVFPFKPPQRIEA 

RTHLQLG SVLYHHTKN SEQ ARSHLEKA WLI S 

QQIPQFEDVKPEAASLLSELYCQENSVDAAKP 

LLRKAIQISQQTPYWHCRLLFQLAQLHTLEKD 

LVSACDLLGVGAEYARVVGSEYTRALFLLSK 

GMLLLMERKLQEVHPLLTLCGQIVENWQGN 

PIQKESLRVFFLVLQVTHYLDAGQVKSVKPC 

LKQLQQCIQTISTLHDDEILPSNPADLFHWLP 

KEHMCVLVYLVTVMHSMQAGYLEKAQKYT 

DKALMQLEKLKMLDCSPILSSFQV1LLEHITM 

HAAQLHTLLGLYCVSVNCMDNAEAQFTTAL 

RLTNHQELWAFIVTNLASVYIREGNRHQEVVN 

LYSLLERINPDHSFPVSSHCLRAAAFYVRGLF 

SFFQGRYNEAKRFLRETLKMSNAEDLNRLTA 

CSLVLLGHIFYVLGNHRESNNMVVPAMQLAS 

KIPDMSVQLWSSALLRDLNKACGNAMDAHE 

AAQMHQNFSQQLLQDHIEACSLPEHNLITWT 

DGPPPVQFQAQNGPNTSLASLL 


835 


2185 


A 


6855 


334 


1268 


PTRRPILPLTSPKAISVPSPLQGKQHTLVKSCL 
SVSG1GOFI V^l ^RMfcTI OTT AVWTAT rp\u<I 

ayvpcqtqdrdalrltleqidltrrmcasyse 

lelvtsakalndtqklacligvegghsldns 

lsilrtfymlgvryltlthtcntpwaessak 

G VHSF YNNI S GLTDFGEK V V AEMNRLGMMV 

DLSHVSDAVARRALEVSQAPVIFSHSAARGV 

CNSARNVPDDILQLLEEERWAFVMVSLFHGE 

LIQWQPIRPMCSTVADHFDHIKAWGSKFTGI 

GGDYDGAGKYRKKTTCKAPWRTSSRMSS 


836 


2186 


A 


6862 


315 


n 


PPRSRPSCWRKKVGPGRPWWWGGTGPPGQG 
RPEIRLLPLPMTGACGAVAASRTGSSGPG/SSL 
PNGHGGKGSGLANGLAGNPXGHLGLGSSFGT 
GPGSGRPPP 


837 


2187 


A 


6863 


2 


1615 


VLRGQRGPAGGLAEERRRGRNEWRIHDVTT 

APFPGLVQRRSRLL1VSQVRYFLKNKVSPDLC 

NEDGLTALHQCCIDNFEEIVKLLLSHGANVN 

AKDNELWTPLHAAATCGHINLVKILVQYGA 

DLLAVNSDGNK4PYDLCEDEPTLDVIETCMAY 

QGITQEKINEMRVAPEQQMIADIHCMIAAGQ 

DLDWIDAQGATLLHIAGANGYLRAAELLLDH 

GVRVDVKDWDGWEPLHAAAFWGQMQMAE 

LLVSHGAN\LNARTSMDEMPIDLCEEEEFKVL 

LEELK\HKHDVIMKSQLRHKSSLSRRTSHRQA 

S/SVGKVVRRTQPVGTGPNUYRKEYE/GEEAI 

LWQRSAXAEDQRTSTYNGDIRETARTDQENKD 
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SEQ ID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 3 
SIO: of 
Deplide 
seq- 
uence 


Viet 
ftod 


SEQ 1 
ID NO: 
m 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcinc, 1 
D=Aspartic Acid, E-Glutamic Acid, 
F=Phenylalanine, G=GJycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=I^ucine, 
M=Methionine, N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine ; V=Valinc, W^Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














PNPRLEK\PVLLSEFPTKIPRGELDMPVENGLR 

APVSAYQYALANGDVWKVHEVPDYSMAYG 

NPGVADATPPWSSYKEQSPQTLLELKRQRAA 

AKLLSHPFLSTHLGSSMARTGESSSEGKAPLI 

GGRTSPYSSNGTSVYYTVTSGDPPLLKFKAPI 

EEMEEKVHGCCRIS 


838 


2188 


A 


6865 


6291 


739 


AGPLEPRVQGAMALQLWALTLLGLLGAGAS 

LRPRKLDFFRSEKELNHLAVDEASGVVYLGA 

VNALYQLDAKLQLEQQVATGPVLDNKKCTP 

PIEASQCHEAEMTDN VNQLLL VDPPR KR1 .VE 

CGQLLKGIXCALRALSNISLRLFYEDGSGEKSF 

VASNDEGVATVGLVSSTGPGGDRVLFVGKG 

NGPHDNGIIVSTRLLDRTDSREAFEAYTDHAT 

YKAGYLSTNTQQFVAAFEDGPYVFFVFNQQD 

KHPARNRTLLARMCREDPNYYSYLEMDLQC 

RDPDIHAAAFGTCLAASVAAPGSGRVLYAVF 

SRDSRSSGGPGAGLCLFPLDEVHAKMEANRN 

ACYTGTREARD1FYKPFHGDIQCGGHAPGSSK 

SFPCGSEHLPYPLGSRDGLRGTAVLQRGGLN 

LTAVTVAAENNHTVAFLGTSDGRJLKVYLTP 

DGTS SEYDSIL VEIN KRVKRDLVLSGDLG SLY 

AMTQDKVFRLPVQECLSYPTCTQCRDSQDPY 

CGWCVVEGRCTRKAECPRAEEASHWLWSRS 

KSCVAVTSAQPQNMSRRAQGEVQLTVSPLPA 

LSEEDELLCLFGESPPHPARVEGEAVICNSPSS 

IPVTPPGQDHVAVT1QLLLRRGNIFLTSYQYPF 

YDCRQAMSLEENLPCISCVSNRWTCQWDLR 

YHECREASPNPEDGI VRAHMEDSCPQFLG PSP 

LVIPMNHETD VNFQGKNLDTVKG SSLHVGSD 

LLKFMEPVTMQESGTFAFRTPKJLSHDANETL 

PLHLYVKSYGKNIDSKLHVTLYDCSFGRSDC 

SLCRAANPDYRCAWCGGQSRCVYEALCNTT 

SECPPPVITRIQPETGPLGGGIRITILGSNLGVQ 

AGDIQRISVAGRNCSFQPERYSVSTRIVCVIEA 

AETPFTGGVEVDVFGKLGRSPPNVQFTFQQP 

KPLSVEPQQGPQAGGTTLTIHGTHLDTGSQED 

VRVTLNGVPCKVTKFGAQLQCVTGPQATRG 

QMLLEVSYGGSPVPNPGIFFTYRENPVLRAFE 

PLRSFASGGRSINVTGQGFSLIQRFAMW1AEP 

LQSWQPPREAESLQPMTVVGTDYVFHNDTK 

VVFLSPAVPEEPEAYNLTVLIEMDGHRALLRT 

EAG AFEY VPDPTFENFTGG VKKQVNKLIRAR 

GTNLNKAMTLQEAEAFVGAERCTMKTLTET 

DLYCEPPEVQPPPKRRQKRDTTHNLPEF1VKF 

GSREWVLGRVEYDTRVSDVPLSLILPLVIVPM 

WVIAVSVYCYWRKSQQAEREYEKIKSQLEG 

LEESVRDRCKKEFTDLMIEMEDQTNDVHEAG 

IPVLDYKTYTDRVFFLPSKDGDKDVM1TGKL 

DIPEPRRPWEQALYQFSNLLNSKSFLINFIHT 

L\ENQPEFSARAKVYFASLLTVALHGKLEYYT 

DIMFITLFLELLEQYVVAKNPKLMLRRSETVV 

FPTvn WWMSTCI YOYLKDSAGEPLYKLFKAI 

KHQVEKGPVDAVQKKAKYTLNDTGLLGDD 

VEYAPLTVSVIVQDEGVDAIPVKVLNCDT1SQ 

VKEKIIDQVYRGQPCSCWPRPDSVVLEWRPG 

STAQILSDLDLTSQREGRWKRVNTLMHYNVR 

DGATLILSKVGVSQQPEDSQQDLPGERHALL 

EEENRVWHLVRPTDEVDEGKSKRGSVKEKE 

RTKAITEIY. .TRLLSVKGTLQQFVDNFFQSVL 

APGHAVPP A VKYFFDFLDEQ AEKHNI QDEDTI 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

r*)=A<snfirtir Arid P^frliitnmir AriH 

F=Phenylalanine, G=Glycine, H^Histidine, 
J=Iso]eucine, K=Lysinc. L=Leucinc, 
M=Methionine, N=Asparagine, P=Proline 5 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine : V=Valine, W=Tryptophan, 
Y— Tyrosine, X = Unknown, * =5 Stop codon, 
/^possible nucleotide deletion, possible 
nucleotide insertion 














HIWKTNSLPLRFWVNILKNPHFIFDVHVHEVV 
D A SLS VI AQTFMD ACTRTEHKLSRDSPSNKJLL 
YAKEISTYKJCMVED Y YKG1RQMVQ VSDQDM 
NTHLAEISRAHTDSLNTLVALHQLYQYTQKY 
YDEIINALEEDPAAQIOvlQLAFRLQQIAAALE 
NKVTDL 


839 


2189 


A 


6872 


1 


1485 


RARRLALQCHVCVCALTPGEQSGRRLPGQT 
WLMFSCFCFSLQDNSFSSTTVTECDEDPVSLH 
EDQTDC S SLRDENNKENYPD AG AL VEEH APp 
S WEPQQQN VE AT VL VDS VLRPSMGNFKSRKP 
KSIFKAESGRSHGESQETWWSSQSECQVRA 
GTPAHESPQNNAFKCQET\VRL\QPRJDQRTAT 
SPKDAFETR\QDLNEEEAAQVHGVKDPAPAS 
TOSVLAVDOTDSADP^PVHIv HfiONF A H<? APP 

DLHSVGTSRLLL/YHITDGDNPTAVRHGCSL/F 
SGQSQRFNLDPESAPSPPSTQQFMMPRSSSRC 
SCGDGK£PQTITQLTKJHIQSLKRKIRKFEEKFE 
QEKKYRPSHGDKTSNPEVLKWMNDLAKGRK 
QLKELKLKLSEEQGSAPKGPPRKLLCEQPTVP 
RENGKPEAAGPEPSSSGEETPDAALTCLKERR 
EQLPPQEDSKVTKQDKNLIKPLYDRYRIIKQIL 
STPSLIPTIVSQDTCMLLLCTDV 


840 


2190 


A 


6873 


2 


2054 


FFRFYFSF1RLFAMSLADLTKTNIDEHFFGVAL 

E>TNRRSAACKRSPGTGDFSRNSNASNKSVDY 

SRSQCSCGSLSSQYDYSEDFLCDCSEKAiNRN 

YLKQPVVKE1CEKKKYNVS1CISQSKGQKEISV 

EKKHTWNASLFNSQIHMIAQRRDAMAHRILS 

ARLHKIKGLKNELADMHHKLEAJLTENQFLK 

QLQLRHLKA1GK YENSQNNLPQIMAKHQNE V 

KNLRQLLRK.SQEKERTLSRKLRETDSQLLKT 

KDILQALQKLSEDKNLAEREELTHKLSIITTK 

MDANDKK1QSLEKQLRJLNCRAFSRQLAIETR 

KTLAAQTATKTLQVEVKHLQQICLKEICDREL 

EIKNIYSHRJLKNLHDTEDYPKVSSTKSVQAD 

RKILPFTSMRHQGTQKSDVPPL/TTKGKKATG 

NFDHKFKSTFrNIl-rPIPl-ir'VlNlKT PkT>Fr>^krrc*r V 
iNii^»xjj\_i->i\.o I Hill n jrxc»ij riv v jNryjui fv^X_/i^olVi\--rv i 

EDLSGEEKHLEVQn.LENTGRQKDKKEDQEK 

KNIFVKEEQELPPKJ1EVIHPERJESNQEDVLVR 

EICFKRSMQRNGVDD1ALGKGTAPYTKGPLRQ 

RRHYSFTEATENLHHGLPASGGPANAGNMR 

YSHSTGKHLSNREEMELEHSXDSGYEPSFGKS 

SRIKVKDTTFRDKKSSLMEELFGSGYVLKTD 

QSSPGVAKGSEEPLQSKESHPLPPSQASTSHA 

FGDSKVTVVNSIKPSSPTEGRRiaiL 


841 


2191 


A 


6874 


3 


2867 


SSRTREMEEKEILRRQIRLLQGLIDDYKTLHG 

NAPAPGTPAASGWQPPTYHSGRAFSARYPRP 

SRRGYSSHHGPSWRKKYSLVNRPPGPSDPPA 

DHAVRPLHGARGGQPPVPQQHVLERQVQLS 

QGQNWIKVKPPSKSGSASASGAQRGSLEEFE 

DTPWSDQRPREGEGEPPRGQLQPSRPTRARG 

TCSVEDPLLVCQKEPGKPRMVKSVGSVGDSP 

REPRRTVSESVIAVKASFPSSALPPRTGVALG 

RKLGSHSVASCAPQLLGDRRVDAGHTDQPVP 

SGSVGGPARPASGPRQAREASLVVTCRTNKF 

RKiWYKWVAASSKSPRVARRALSPRVAAEN 

VCKASAGMANKVEKPQLIADPEPKPRKPATS 

SKPGSAPSKYKWKASSPSASSSSSFRWQSEAG 

SKDHASQLSPVLSRSPSGDXRPALAHSGLKPLS 

GETPLSAYKVKTRTKJIRRRGSTSLPGDKKSG 

TSPAATAK SHLSI .RRRQALRGKSSP VLKKTPN 



260 



3DOCID: <WO 0157188A2J_> 



WO 01/57188 



PCT/US01/03800 



SEO ID ! 
NO: of 1 
nucl- ] 
eotide 
seq- 
uence 


3EQ ID 1 
SO: of 
Deptide 
seq- 
uence 


Viet 
hod 


SEQ 1 
[D NO: 1 
n i 
LJSSN 
09/496 
914 


Predicted 3 
beginning i 
mcleotide 
ocation 
correspond! 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
lucleotide 1 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


\rnino acid sequence (A=Alanme C=Cysteine 5 
D=Aspartic Acid, E^Glutamic Acid, 
F=Phenylalanine, G=Glycine } H=Histidine, 
[=Isoleucine, K=l ,ysine ; L=Leucine, 
M=Methioriine ? N=Asparagine, P=Proline : 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














KGLVQVTKHRLCRLPPSRAHLPTKEASSLHA 

VRTAPTSKVHCTRYRIVKKTPASPLSAPPFPLS 

LPSWRARRLSLSRSLVLNRLRPVASGGGKAQ 

PGSPWWRSKGYRCIGGVLYKVSANKLSKTSG 

QPSDAGSRPLLRTGRLDPAGSCSRSLASRAVQ 

RSLAIIRQARQI^REKRJCEYCMYYNRFGRCNR 

GERCPYIHDPEKV AVCTRFVR01 CKJ^ l uu l L, 

PFSHHVSKEKMPVCSYFLKGICSNSNCPYSHV 

YVSRKAEVCSDFLKGYCPLGAKCKKKHTLLC 

PDFARRGACPRGAQCQLLHRTQIOU-ISRRAAT 

SPAPGPSDATARSRVSASHGPRICPSASQRPTR 

QTPSSAALTAAAVAAPPHCPGGSASPSSSKAS 

SSSSSSSSPPASLDHE\APSLQEAALAAACSNR 

LCKLPSFISLQSSPSPGAQPRVRAPRAPLTKDS 

GKPLHIKPRL 


842 


2192 


A 


6898 


506 


2071 


WPDLVHTWSSEEAMGSCCSCPDKDTVPDNH 

RNKFKVINVDDDGNELGSGIMELTDTELILYT 

RKRDSVKWHYLCLRRYGYDSNLFSFESGRRC 

QTGQGIFAFKCARAEELFNMLQEIMQNNSIN 

WEEPVVERNNHQTELEWRTPRTPTTPGFAA 

QNLPNGYPRYPSFGDASSHPSSRHPSVGSARL 

PSVGEESTHPLLVAEEQYHTYVNTTGVQEER 

KNRTSVHVPLEARVSNAES STPKEEPSSIEDR 

DPQ1LLEPEGVKFVLGPTPVQKQLMEKEKLE 

QLGRDQVSGSGANNTEWDTGYDSDERRDAP 

SVNKLVYENINGLSIPSASGVRRGRLTSTSTSD 

TQN1NNSAQRRTALLNYENLPSLPPVWEARK 

LSRDEDDNLGPKTPSLNGYHNNLDPMHNYV 

NTEN VTVP AS AHKIEYSRRRDCTPT VFNFDIR 

RPSLEHRQLNYIQVDLEGGSDSDNPQTPKTPT 

TPLPQTPTRRTEL Y A VIDLERTAAM L^kAL 

PRDDGTSR\KTRHNST\DLPL 


843 


2193 


A 


6919 


2 


663 


AGRPGTTFLASGKMAYQSLRLEYLQIPP V SRA 

YTTACVLTTAAVQLELITPFQLYFNPELIFKHF 

QIWRLITNFLFFGPVGFNFLFNMIFLYRYCRM 

LEEGSFRGRTADFWMFLFGGFLMlJLrULr vb 

LA^LGPGLYNN/GSSMCGAEXEPLCPHELLRP 

SQLPGPLSALGAHGIFLWGELNHCGPFGYCS 

WTHIFFLGRCISQSTWWNKNSENTIYFESYF 


844 


2194 


A 


6928 


902 


366 


HRLCMP1QGACGERME/FSLLLPGLECNUV1L 
AHCNLRLPGSSNSPASASQVAGITGVCHHAR 
LIFVFSVETGFLHAGQAGLELLTSGDPPASAS 
QSAGITGKSQI ITRPGYEFIIPYSAAQEDALKA 
LM 


845 


2195 


A 


6939 


1660 

- 


317 


L YPENLGE SLFPILLLPPP WPDG GRPCC VEMS 

TRAKKLRRIWR1LEEKESVAGAVQTLLLRSQE 

GGV\TSAAASTLSEPPRRTQESRTRTRALGLPT 

LPMEKLAASTEPQGPRPVLGRESVQVPDDQD 

FRSFRSECEAEVGWNLTYSRAGVSVWVQAV 

EMDRTLHKIKCRMECCDVPAETLYDVLHD1E 

YRJCKWDSNVIETFD1 ARLTVNADVGY'YS WR 

CPKPLKNRDVITLRSWLPMGADYIIMNYSVK 

HPKYPPRKDLVRAVSIQTGYLIQSTGPKSCVIT 

YLAQVDPKGSLPKWV\^SSQFLAPKAMKK 

MYKACLKYPEWKQKHLXPHFKPWLVHPEQSP 

LPSLALS\ELSVQHADS\LENIDESAV\AESREE 

RVMGGAGGEGVSDDDTSLYAEAPHRFRETETG 

PGAGRALGAAAAPALSPLHPPGTWWHRARP 

RRVLQPGWTEPQ 
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SEQ TD 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 

\j — /Vbpixrui' /acjli, el — vjf midline /aciu, 

F=Phcnyl alanine, G=Glycine, H=Histidine, 
I-I so leucine, K^Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Ghitamine, R=Arginine ? S=Serine, 
T=Threonine ; V= Valine, W=Tryptophan, 
Y=Ty rosin e, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion. V=possible 
nucleotide insertion 


846 


2196 


A 


6944 


42 


2672 


RRKMAGCRGSLCCCCRWCCCCGERETRTPE 

ELTILGETQEEEDE1LPRKDYESLDYDRCINDP 

YLEVLETMDNKKGRRYEAVKWMVVFAIGV 

CTGLVGLFVDFFVRLFTQLKFGWQTSVEECS 

QKGCLALSLLELLGFNLTFVFLESLLGLIEPVE 

AGSGITEGKCYLYARQVPGLVRJ.PTLLWKAL 

GVLLTVAAMLL1\GLGSPMIHSGSVVGAGLPQ 

FQSISLRKIQFNFPYFRSDRYGKADKRDFVSAG 

AAAGVAAAFGAPIGGTLFSLEEGSSFWNQGL 

TWKVLFCSMSATFTLNFFRSGIQFGSWGSFQL 

PGLLNFGEFKCSDSDKKCHLWTAMDLGFFV 

VMGVIGGLLGATFNCLNKRLAKYRMRNVHP 

KPKLVRVLESLLVSLVTTVVVFVASMVLGEC 

RQMSSSSQIGNDSFQLQVTEDVNSSIKTFFCP 

NDTYNDMATLFFNPQESAILQLFHQDGTFSPV 

TLALFFVLYFLLACWTYGISVPSGLFVPSLLC 

GAAFGRLVANVLKSYIGLGHIYSGTFAUGAA 

AFLGGWRMTISLTVILIESTXNEITYGLPIMVT 

LMVGKWTGDFFNKGIXYDIHVGLRGVPLLEW 

SELRTTVHHAFPVVTENRGNEKEFMKGNQL1S 

NNIKFKKSSILTRAGEQRKRSQSMKSYPSSEL 

RNMCDEHIASEEPAEKEDLLQQMLERRYTPY 

PNLYPDQSPSEDWTMEERFRPLTFHGULRSQ 

LVTLLVRGVCYSESQSSASQPKJLSYAEMAED 

YPRYPDIHDLDLTLLNPRMIVDVTPYMNPSPF 

TVSPNTHVSQVFNLFRTMGLRHLPVVNAVGE 

I VG JITRHNLTYEFLQARLRQH YQTI 


847 


2197 


A 


6951 


3 


1994 


NTNSSSVTNSAAGVEDLNIVQVTVPDNEKER 

LSSIEKIKQLREQVNDLFSRICFGEAIGVDFPVK 

VPYRKITFNPGCVVIDGMPPGVVFKAPGYLEi 

SSMRRILEAAEFIKFTVIRPLPGLELSNGEYST 

VGICRKIDQEGRVFQEKWERAYFFVEVQNIST 

CLICKRSMSVS[<£YNLRRHYQTNHSKHYDQY 

MERMRDEKXHELKKGLRKYLLGLSDTECPE 

QKQVFANPSPTQKSPVQPVEDLAGNLWEKLR 

EKIRSFVAYSIAIDE1TDINNTFQLA1FIRGVDE 

NFDVSEELLDTVPMTGTKSGNEIFSRVEKSLK 

NFCINWSKLVSVASTGTPPMVDANNGLVTKX 

KSRVATFCKGAELKSTCCnHPESLCAQ\I<JLKK4 

r^HVA/frYV/VVT^^VTOWTr^Pril kft-? c it uttt r VPT 
un. v lvli_j v \ v fvo v In w iv^olvOLINriot»r I ILL Y liL 

DSQYGSLLYYTE1KWLSRGLVLKRFFESLEEI 

DSFMSSRGKPLPQLSSIDWIRDLAFLVDMTM 

HI NAT NTS! OOH^OTVTOMVTM TP APT AVI r*T 

WETHLTRNNLAHFPTLKLVSRNESDGLNYIP 

KIAELKTEFQKRLSDFKLYESELTLFSSPFSTKI 

DSVHEELQMEVIDLQCNTVLKTKYDKVGTPE 

FYKYLWGSYPKYKHHCAKILSMFGSTYICEQ 

LFSIMKLSKTKYCSQLKDSQWDSVLHIAT 


848 


2198 


A 


6985 


3 


289 


SVQYLPGRPTRTHASTDAPLMLKFTPLPSKTK 
ASAPVQCLLLMAATFSPQGLAKPHSGTIPITXC 
CFNAINTKIPIQRLESYTRITNIQCPKEAVM 


849 


2199 


A 


6999 


963 


5 


LDFLCHRDMGDNITSITEFLLLGFPVGPRIQM 

LLFGLFSLFYVFTLLGNGTILGLISLDSRJLHAP 

M\TFLSHL\AVVD1AYACNTVPRMLVNLLHP 

AKPISFAGRMMQTFLFSTFAWECLLLVVMS 

YDLYVXAICHPLRYLAJMTWRVCITLAVTSWT 

TGVLLSLIHLVLLLPLPFCRPQKIYHFFCEILA 

VLKLACADTHINENMVLAGA1SGLVGPLSTIV 

VSYMCILCAILQIQSREVQRKAFCTCFSHLCVI 
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SEQ ID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
MO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

L)=Aspartic ACia, b,— oiuiami^ /\^iu, 

F=Phenylalaninc, G=Glycine, H=Histidine, 

t TcnUnninp T£=l vcinp 1 — T .fiicine 

i — isoieucmc, rv i^yoiii^^ *-> iaiuvmiv, 

M=Methionine, N=Asparagine, P=Proline, 

Q=Glutamine, R=Arginine, S=Serine, 

T=Threonine, V=V aline, W-Tryptophan, 

Y=Tyrosine, X=Unknown 5 *=*Stop codon, 

/=possiblc nucleotide deletion, \=possible 

nucleotide insertion 














GLFYGTAIIMY VGPRYGNPKEQKKYLLLFH S 
LFNPMLNPLICSLRN S E VKNTLKR VLG V ERAL 


850 


2200 


A 


7001 


1 


1011 


MGNDSVSYEYGDYSDLSDRPVDCLDGACLAI 
DPLRV APLPL YAAIFLVG VPGN AM VA WV AG 

PIARGGHWPYGAVGCRALPSIILLTMYASVLL 
LAALSADLCFLALGPAWXCLRFS/GACGVQVA 
CG AA WTL ALLLT VP S AI YRRLHQEHFP ARLQ 
CVVDYGGSSSTENAVTAIRFLFGFLGPLVAVA 
SCHSALLCWAARRCRPLGTAIVVGFFVCWAP 
YHLLGLVLTVAAPN S ALLARALRAEPLT VGL 
ALAHSCLNPMLFLYFGRAQLRRSLPAACHW 
AJLRESQGQDESVDSKKSTSHDLVSEMEV 


851 


2201 


A 


7011 


1 


2310 


AAASPLRMSRKGPRAEVCADCSAPDPGWASI 

SRGVLVCDECCSVHRSLGRHISIVKHLRHSA 

WPPTLLQMVHTLASNGANSIWEHSLLDPAQV 

QSGPALKQTPKDKVXHPIKSEFIRAKYQMLAF 

WKLPCRDDDGVTAKDLSKQLHSSVRTGNLE 

TCLRLLSLGAQANFFHPEKGTTPLHVAAKAG 

QTLQAELLVVYGADPGSPDVNGRTP1DYARQ 

AGHHELAERLVECQYELTDRLAFYLCGRKPD 

HKNGHYIIPQMADSLDLSELAKAAKKKLQAL 

SNRLFEELAMDVYDEVDRRENDAVWLATQN 

HSTLVTERSAVPFLPVNPEYSATRNQGRQKL 

ARFNAREFATLI1D1LSEAKRRQQGKSLSSPTD 

NLELSLRSQSDLDDQHDYDSVASDEDTDQEP 

LRSTGATRSNRARSMDSSDLSDGAVTLQEYL 

ELKKALATSEAKVQQLMKVNSSLSDELRRLQ 

REIHKLQAENLQLRQPPGPVPTPPLPSERAEH 

TPMAPGGM HKKJJKvAr JM X truc>/\Ll\xruu 

PPGDEL1TRLQPFHSTELEDDAJYSVHVPAGL 

YRIRKGVSASAVPFTPSSPLLSCSQEGSRHTSK. 

LSRHGSGADSDYENTQSGDPLLGLEGKRFLE 

LGKEEDFHPELESLDGDLDPGLPSTEDY1LKT 

EQVTKNIQELLRAAQEFKHDSFVPCSEKIHLA 

VTEMASLFPKRPALEPVRSSLRLLNASAYRLQ 

SECRKTVPPEPGAPVDFQLLTQQV1QCAYD1A 

KAAKQLVTITTREKKQ 


852 


2202 


A 


7016 


484 


1777 


" RISKIQVYYSTGYSSRKIvlNPTLGLAIFLAVLL 
TVKGLLKPSFSPRNYKALSEVQGWKQRMAA 
KELARQNMDLGFKLLKKI . AFYNPGRNIFLSP 
LSISTAFSMLCLGAQDSTLDEIKQGFNFRKMP 
EKDLHEGFHYIIHELTQKTQDLKLSIGNTLFID 
QRJLQFQl>J^i^t^J^AJvrNr X o/aj-zI ili iNrv^iNi^civi 
AQKQINDFI/ESKTHGKINNLIENIDPGTVMLL 
ANYIFFRARWKHEFDPNVTKEEDFFLEKNSS 
VKVPMMFRSGIYQVGYDDKXSCTILEIPYQK 
NITAIFILPDEGKEKHLEKGLQVDTFSRWKTL 
LSRRWDVSVPRLHMTGTFDLKKTLSYIGVS 
KIFEEHGDLTKIAPHRSLKVGEAVNKAELKM 
DERGTEGAAGTGAQTLPMETPLVVKIDKPYL 
LLIYSEKIPSVLFLGK1VNPIGK 


853 


2203 


A 


7017 


1 


3293 


" "MTHACNPSTLGGQGRRITRSHGRIUISSRGPV 
ARHVAAGAGHENKHGGSRRFPAGVAPRRAM 
ANVSKKVSWSGRDRDDEEAAPLLRRTARPG 
GGTPLLNGAGPGAARQSPRSALFRVGHMSSV 
ELDDELLEPXDMDPPHPFPKEIPHNEKLLSLKY 
ESLDYDNSENQLFLEEERRINHTAFRTVEIKR 
WV1CALIGILTGLVACF1 31VVENLAGLKYRVI 
KGSILPNIDKFTEKGGLSFSLLLWATLNAAFV 
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>ISDOCID <WO 0157188A2_L> 



WO 01/57188 




T/US0 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 

P>=A^n?irtic Acid P=T"ilntfimir ApiH 

F=Phenylalanine 5 G=-Glycine, H=Histidine, 
f=IsoIcucinc, K=Lysine } L=Leucinc, 
M=Mcthionine, N^Asparagine, P^Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine a V=Valine, W^Tryptophan, 
Y=Tyrosine, X=Unk:nown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














LVGSVIVAFIEPVAAGSGIPQIKCFLNGVKIPH 

VVRI XTLVIKVSGVILSV VGGLAVGKEGPMI 

HSGSVIAAGISQGRSTSLKRDFK1FEYFRRDTE 

KRDFVSAGAAAGVSAAFGAPVGGVLFSLEEG 

ASF WNQFLTWRIFFASMISTFTLNF VLSI YHG 

NM WDLS SPGLINFGRFDSEKMAYTIHEIPVFI 

AMGVVGGVLGAVFNALNYWLTMFRIRYJHR 

PCLQVIEAVLVAAVTATVAFVLIYSSRDCQPL 

QGGSMSYPLQLFCADGEYNSMAAAFFNTPEK 

SVVSLFHDPPGSYNPLTLGLFTLVYFFLACWT 

YGLTVSAGVFIPSLLIGAAWGRLFGISLSYLTG 

AAIWADPGKYAXMGAAAQLGGIVRMTLSLT 

VIMMEATSNVTYGFPIMLVLMTAKIVGDVFIE 

GLYDMHIQLQSVPFLHWEAPVTSHSLTAREV 

MSTPVTCLRRREKVGVIVDVLSDTASNHNGF 

PVVEHADDTQPARLQGLELRSQLIVLLKHKVF 

VERSNLG L VQRRLRLKDFRD A YPRFPPIQSIH 

VSQDERECTMDLSEFMNPSPYTVPQEASLPR 

VFKLFRALGLRHLVVVDNRNQWGLVTRKD 

AGAAQQPCQLRAVTLEDLGLLLAGGLASPEP 

LSLEELSERYESSFIPTSTASVPEQDTAKHWNQ 

LEQWVVELQAEVACLREHKQRCERATRSLL 

RJELLQVRARVQLQGSELRQLQQEARPAAQAP 

EKEAPEFSGLQNQMQALDKRLVEVREALTRL 

RRRQVQQEAERRGAEQEAGLRLAKLTDLLQ 

QEEQGRE VACGALQfCNQEDSSRRVDLEVAR 

M 


854 


2204 


A 


7037 


139 


2604 


AGTWEPRPYDQAKETGAPGSQPPVPPMELRP 

WLLWVVAATGTLVLLAADAQGQKVFTNTW 

AVRIPGGPAVANSVARKHGFLNLGQIFGDYY 

HFWHRGVTKRSLSPHRPRHSRLQREPQVQWL 

EQQVAKRRTKRDVYQEPTDPKFPQQWYL\SG 

VTQVRDLMVKAAWAQGYTGHGIVVSILDDGI 

EKNHPDLAGNYDPGASFDVNDQDPDPQPRY 

TQMNDNRHGTRCAGEVAAVANNGVCGVGV 

AYNARIGGVRMLDGEVTDAVEARSLGLNPN 

HIHIYSASWGPEDDGKTVDGPARLAEEAFFR 

GVSQGRGGLGSIFVWASGNGGREHDSCNCD 

GYTNSIYTLSISSATQFGNVPWYSEACSSTLA 

TTYSSGNQNEKQIVTTDLRQKCTESHTGTSAS 

APL AAGI I ALTLEANKNLT WRDMQHL V VQTS 

KPAHLN ANDWATNG VGRKVSHS YG YGLI.D 

AGAMVALAQNWTTVAPQRKCIIDILTEPKD1 

GKRLEVRKTVTACLGEPNHITRLEHAQARLT ! 

LSYNRRGDLAIHLVSPMGTRSTLLAARPHDY 

SADGFNDWAFMTTHSWnFnP^OFWVT FTF"NI 

TSEANNYGTLTKFTLVLYGTAPEGLPVPPESS 

GCKTLTSSQACVVCEEGFSLHQKSCVQHCPP 

GFAPQVLDTHYSTENDVETIRASVCAPCHAS 

CATCQGPALTDCLSCPSHASLDPVEQTCSRQS 

QSSRESPPQQQPPRLPPEVEAGQRLRAGLLPS 

HLPEVVAGLSCAFIVLVFVTVFLVLQLRSGFS 

FRG VK VYTMDRGLI S YKGLPPEA WQEECPSD 

SEEDEGRGERTAFIKDQSAL 


855 


2205 


A 


7058 


3 


1441 


QRPASQLLAPFAAEAJLPGAPRAAMAQHFSLA 

ACDWGFDLDHTLCRYNLPESAPLIYNSFAQF 

LVKEKGYDKELLNVTPEDWDFCCKGLALDL 

EDGNFLKLANNGTVLRASHGTKMMTPEVLA 

EAYGKK£WKHFI,SDTGMACRSGKYYFYDN 
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5 DOC ID: <WO 0157188A2_L> 



WO 01/57188 



PCT/U SO 1/03800 



SEQ ID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 1 
HO: of 
peptide 
seq- 
uence 


Viet ! 
nod 


SEQ 1 
ID NO: 1 
n i 
USSN 
09/496 
914 


Predicted 1 
beginning i 
nucleotide 
ocation 
correspond! 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C~Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
^Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline 5 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














YFDLPGALLCARVVD YLTKLNNGQKTFD1* W 

ICDIVAAIQHNYFCMSAH<ENCGIYFPEIKRDPG 

RYLHSRPES VKJvWLRQLK.NAUiSai^L.Li i aana 

DYCRLLCA\YILGNDFTDLFDIVITNALKPGFF 

SHLPSQRPFKTLENDEEQEALPSLDKPGWYSQ 

GNAVHLYELLKKMTGKPEPKWYFGDSMHS 

DIFPARHYSNWETVLILEELRGDEGTRSQRPE 

ESEPLEKKGKYEGPICAKPLNTSSKKWGSFFU 

DSVLGLENTEDSLVYTWSCKRISTYSTIAIPSI 

EAIAELPLDYKFTRFSSSNSKTAGYYPNPPLV 

LSSDETLISK 


856 


2206 


A 


7082 


396 


1635 


SSPSVFEFEHAVQPVFTMEFLKTCVLRKKACT 

AVCFWRSKVVQKPSVRR1STTSPRSTVMPAW 

VIDKYGKNEVLRFTQNMMMPIIHYPNEVIVK 

VHAASVNPIDVNMRSGYGATALNMKRDPLH 

VKIKGEEFPLTLGRDVSGVVMECGLDVKYFK 

PGDEVWAAVPPWKQGTLSEFVVVSGNkvbH 

KPKSLTHTQAASLPYVALTAWSAINKVGGLN 

DKNCTGKRVLILGASGGVGTFAIQVMKAWD 

AHVTAVCSQDASELVRKLGADDVIDYKSGSV 

EEQLKSLKPFDF1LDNVGGSTETWAPDFLKK 

WSGATYVTLVTPFLLNMDRLGIADGMLQTG 

VTVGSKALKHFWKGVHYRWAFFMASGPCL 

DD1AELVDAGKIRPVMEQTFPFSKVPEAFLKV 

ERGHARGKTVINVV 


857 


2207 


A 


7088 


320 


2417 


LRRRKMTPQSLLQTTLFLLSLLFLVQGAHGR 

GHREDFRFCSQRNQTHRSSLHYKPTPDLRISIE 

NSEEALTVHAPFPAAHPASRSFPDPRGLYHFC 

LYWNRHAGRLHLLYGKRDFLLSDKASSLLCF 

QHQEESLAQGPPLLATSVTSWWSPQNISLPSA 

ASFTFSFHSPPHTGAHNASVDMCELKRDLQL 

LSQFLKHPQKASRRPSAAPASQQLQSLESKLT 

SVRFMGDMGSFEEDRINATVWKLQPTAGLQ 

DLHIHSRQEEEQSE1MEYSVLLPRTLFQRTKG 

RSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 

KVLGIVVQNTKVANLTEPWLTFQHQLQPKN 

VTLQCVFWVEDPTLSSPGHWSSAGCETVRRE 

TQTSCFCNHLTYFAVLMVSSVEVDAVHKHY 

L SLL S Y VG C W S AL ACL VTI AAYLC SRVPLPC 

RRKPRDYTIKVHMNLLL AVr LLD 1 bbLLafcr v 

ALTGSEAGCRASAIFLHFSLLTCLSWMGLEG 

YNLYRLVVEVFGTYVPGYLLKLSAMGWGFPI 

FLVTLVALVDVDN YGrllLA VhKl rtu v 1 1 ra 

MCWIRDSLVSYITNLGLFSLVFLFNMAMLAT 

MVVQILRLRPHTQKWSHVLTLLCLSLVLGXLP 

WALIFFSFASGTFQLWLYLFSIITSFQGFLIFI 

1x^xr\T^c^/mT r\ A D/lfTDQ'PT K"^KF^n^ A"R T PTSSGS 
WYWoMl^yAKuur or Lrvin ouomu^r ioovjo 

TSSSRI 


858 


2208 


A 


7091 


185 


415 


DAGA VKSSDTN1 WFRGMCDDKKGHRCP^ U 

QPQHFHVAFHTEAEGAMFYFRLHV1HRVMQS 

OOQLFPSTLFSWLLE 


859 


2209 


A 


7136 


3 


302 


FFFWRQSLALLPRLECSGATGAHCNLHFFUSS 
DCPTSAS*1AGITGACYHAWLLFVFLAETGFH 
HVGQGGLELLTSSDPSGSASQSAGITGVSHCT 
WPT 


860 


2210 


A 


7156 


23 


591 


ALSTETRTPDMRRLLLVTSLVVVLLWEAGAV 
PAPKVPIKMQVKHWPSEQDPEKAWGARWE 
PPEKDDQL V VLFP V QKPKLLTTEEKPRG QGR 
GPILPGTKAWMETEDTLGRVLJPEPDHDSLY 
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MSDOCID <WO 0157188A2_I_> 



WO 01/57188 




T/US01/03800 



SEQ ID 

nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

V-\ . -w » nn inn 

Dcginning 
nucleotide 
location 

Of irrt*^nmifl i 

vUl 1 toJJUJ 1U 1 

ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

In Ifi^t amino 

IU Idol allllllLT 

acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, 
i^^/Yspanic /vcia, ii^vjiu ramie /vciu, 
F=Phenylalanine, G=Glycine. H=Histidine, 
I=IsoIcucine, K=Lysine, L=Leucinc, 

lvi iv IC-IUIUIUIIV, 1> AipdldglllC, J JTIUIUIC, 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, W=Tryptophan. 
Y— Tyrosine X=Unknown *=Ston codon 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














HPPPEEDQGEERPRLWVMPNHQVLLGPEEDQ 
DHIYHPQ* GSRGHHCPRPVPRPFLLLGLGPSLP 
CPS 


861 


2211 


A 


7161 


1220 


1003 


NYVCTIAF*EKKMGF*LSLSCLVI I FVI FI DPT 

LTTTTRIMFHCTYLFASVCLSLLNTLLSPNCL 

KSAMILQ 


862 


2212 


A 


7211 


665 


847 


LKYYHITMGIYKTGKKVIL^KSSMSNRFSVIF 
YKNIQKXSFSNYVYHQNYVFSSDWSYDF 


863 


2213 


A 


7212 


924 


1273 


HGSSCALGDLAPG*LPSGPVLSSPAVRL*RKP 
LV\VDSPSCLPATGPT*GLVLVLGGPDCT*WA 
RGQI^HKRN<fRAP*SCRVTWLAKKKKKTDQ 
CIKPNYQSPPKECDYNILANSVA 


864 


2214 


A 






io iy 


DPKVP VDADHVQGQDPGRAAI-IDII IGED VTE 
v'vctrpvpT a PnFvnnTnp^Hnp hhud pynnD 

Iv V OL\*LJrL^f\trlJlL V \JU 1 J^JDOMJ^JtvHOiiXvD V Vj l^K 

HGHDQEEVAYEERACEGGKFATVEVTDKPV 
DEALREAMPKVAKYAGGTNDKGIGMGMTV 
PISFAVFPNEDGSLQKKLKVWFRIPNQFQSDP 
PAPSDKS VKIEEREGIT VYSMQFGG YAKE AD 
YV AQATRLRAALEGTATYRGDI YFCTG YDPP 
MKPYGRRNEIWLLKT 


865 


2215 


A 


7246 


559 


682 


RRLGAVAHAYTSSTLGGRGGWTT*GQELQTS 

I A "MM A ft*" PPT V 


866 


2216 


A 


7257 


641 


1310 


TCTYKYLMGWIRGRRSRHSWEMSEFHNYNL 
DLKKSDFSTRWQKQRCPVVKSKCRENASPFF 

PPrFTA V A N./ff~iTl?"FITN/r\/ A I\X/<3A\/PT "MQT CMfYCW 

rLLrl/\ v AivioiivxilIVl VAJ WoA V rLfNoJLr N^bv 
QIPLTESYCGPCPKNWICYKNNCYQFFDESKN 
WYESQASCMSQNASLLKVYSKEDQDLLKLV 

rvo I n w ivivji_ v nil i inuo w w LLL/UolL/Dr !\LL l 

IIEMQKGDCALYASSFKGYIENCSTPNTYICM 
QRTV 


867 


2217 


A 


7288 


151 


396 


SIKIIEAFGSNGPDF WFFRYWSP* LFRQQ V VFI 
MPFFQTLWL MN ANRFC S I FTTTN V ANN C W W 
TPYHCWLSVVVCRCESHGI 


868 


2218 


A 


7298 


3 


272 


PDTVIGGRGSGGKEFGRWVLW*VFE*RLGTP 
KGSCPAGGSRMVSESD*EGRGC*ASYPCAC* 
AGS* WR* GSRPAGRGTPPRSLSHARPP 


869 


2219 


A 


7332 


1223 


332 . 


PRRDAEDRDESCLNPAFPIGLLHPNSVNSMAJR 

YRLVl^ADns^LACVMECEGKLPSLKIWETC 

KELLQLSKPELPQDGTSTLRENSKPEESHLLA 

KRYGGFMI<RYGGFN4KICMDELYPMEPEEEA 

NGSE1LAKRYGGFMKKDAEEDDSLANSSDLL 

KELLETGDNRERSHHQDGSDNEEEVSKRYGG 

FMRGLKJRSPQLKEICAI^LQK^YGGFMRRVG 

PQKW*MTSPQNRYGGFLKRFAEALPSDEEGE 

SYSK£VPEMEKRYGGFIV1RF 


870 


2220 


A 
AY 




— 1 U 


lulu 


AGQGRGREGAESGGSRGEGPGSDGRLPATGD 
FWSPRSQRRGCCGRRAPRPEAMENGA V YSPT 
TEEDPGPARGPRSGLAAYFFMGRLPI I RRV1 
KGLQLLLSLLAFICEEVVSQCTLCGGLYFFEF 
VSCS AFLLSLLILI V YCTPFYERVDTTKVKS SD 
FYITLGTGCVFLLASIIFVSTHDRTSAEIAAIVF 
GFIASFMFLLDF1TMLYEKRQESQLRKPENTT 
RAE ALTEPL N A 


871 


2221 


A 


7403 


3 


393 


SCAMCSGLL*LLLPIWLSWTLGTRGSEPRSVN 
DPGNMSFVKETVDKLLTGFRCFREREAAPRR 
ALRGAALPGESEAGDPESLRSSVNADWIQYS 
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SOOCtD: <WO 0157168A2J_> 



WO 01/57188 




PCT/U SOI /038(H) 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ TD 
MO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding, 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C^Cysteine, 

a ennrtic Acid T 7 =CiiutamiC Acid. 

F=Phenylalanine, G=Glycine, H=Histidine, 
l^T^oleiicine K=Lv*5ine L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine 5 S=Serine, 
T^hreonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown 5 *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














DLWEAEVSTPRCEAGFCQECFRTPGNQEKDG 
PFIC 


872 

i 


2222 


A 


7413 


1061 


359 


FVDIVSVVEFPHCPEARFPAQHGQDSKRJLTLC 

PGGS* PQATLHLDRMRVSASPTKEIQ VKKYK 

CGLDCPCPANYFAFKJCSGAANVVGPTMCFED 

RMIMSPVKNNVGRGLNIALVNGTTGAVLGQ 

KAFDMYSGDVMHLVKFLKE1PGGALVLVAS 

YDDPGTKMNDESRKLFSDLGSSYAKQLGFRD 

SWVFIGAKDLRGKSPFEQFLKEQPQTQNKYE 

GWPELLEMEGCMPPKPF 


873 


2223 


A 


7429 


2242 


2394 


" ILKCAGHGGSCL* SQHFGRLRWEDRLRLG VQ 
nHPHOT-rrFTPST I KJERKLF 


874 


2224 


A 


7468 


146 


894 


PCTSCVLWA'lLHLPASTRKAPQAECGMlSrrE 
W OK 1 n VGTTGFGIFFILFGTLL YFD S VLL AFGN 
LLFLTGLSLIIGLRKTFWFFFQRHKLKGTSFLL 
GGVVIVLLRWPLLGMFLETYGFFSLFKGFFPV 
AFGFLGNVCNIPFLGALFRRLQGTSSMV*KTE 
M S SLNLDHWLKG AKREE WEPPPQ SP ALTH SP 
TYPGPPQVQKERNGAHQJLTSNPQVDSRGCQE 
AEMQTPRJU,GWGWYHTLTLYLWEEK 


875 


2225 


A 


7498 


91 


251 


GEKPVPTWLQDEAGQWLLGFVAQPWGWPG 
SERHEP * HG G VLFRLGPS APPGKL 


876 


2226 


A 


7544 


403 


587 


YSCLCFLFKHITSFKNSVHIWLGTVVHAYNPN 
1LGGQGGWIA*GQEFKTSLGNTVRPCLYK 


877 


2227 


A 


7566 


2 


940 


GCAPDTRFFVPEPGGRGAAPWVALVARGGC 

TUVTWMl VA ARRNA^AWl YNFERYGN1TLP 

MSHAGTGN1VVIMISYPKGREILELVQKGIPV 

TMTIGVGTRHVQEFISGQSVVFVAIAFITMM1I 

SLAWLIFYYIQRFLYTGSQIGSQSHRKETKKV1 

GQLLLHTVKHGEKGIDVDAE'NCAVCIENFKV 

KDIIRILPCKHIFHRICIDPWLLDHRTCPMCKL 

DVIKALGYWGEPGDVQEMPAPESPPGRDPAA 

NLSLALPDDDGSDESSPPSASPAESEPQCDPSF 

KGDAGENTALLEAGRSDSRHGGPIS 


878 


222S 


A 


7586 


315 


1232 


' ERSLLCKVD VR WI Y V SEGTKTQRRHRQG SLR 
T?nuuAAAPWYVT FT 1 OPTVYLVTCANLTNG 
ok^fi T KSGSSKSTLKI-IIWTESSKDLSISRLLS 
QTFRGKENDTDLDLRYDTPEPYSEQDLWDW 
LRN STDLQEPRPRAKRRPIVKTGKFKKMFGW 
GDFHSNIKTVKI.NLLITGKTVDHGNGTFSVYF 
RHNSTGQGNVSVSLVPPTKJVEFDLAQQTVID 
AKDSKSFNCRIEYEKVDKATKNTLCNYDPSK 
TCYQEQTQSHVSWLCSKPFKVICIYISFYSTD 
YKLVQKVCPDYNYHSDTPYFPSG 


879 


! 2229 


A 


7605 


479 


39/ 


' TESWKLKWWSPTCLDQLNGSAPGNVFIHG 


880 


2230 


A 


7612 


93 


659 


DAAVAMTAQGGLVANRGRRFKWA1ELSGPG 

GGSRGRSDRGSGQGDSLYPVGYLDKQVPDTS 

VQETDRILVEKJICWD1AJLGPLKQ1PMNLFIMY 

MAGNTISIFPTMMVCMMAWRPIQALMAISAT 

FKMLESSSQKFLQGLVYLIGNLMGLALAVYK 

CQSMGLLPTHASDWLAFIEPPERMEFSGGGL 

LL 


881 


2231 


A 


7615 


291 


1452 


" SPQKT^SHTITMTTTSVSSWPYSSFIRMRFIT 
NHSDQPPQNFSATPNVTTCPMDEKLLSTVLTT 
SYSVIFIVGLVGNI1ALYVFLGIFIRKRNSIQIYL 
LNVAIADJXLIFCLPFRIJVIYHINQNKWTLGVIL 
CKVVGTLFYMNMYIS11LLGF1SLDRY1KINRS1 
QQRKAITTKQSIYVCCIVWMLALGGFLTM11L 
TLKKGGHNSTMCFHYRDKHNAKGLAIFNFIL 
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^SDOCID- <WO 0157168A2_I_> 



WO 01/57188 




T/US01/03800 



SEQID 

"NO- nf 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

NO- nf 

peptide 
seq- 
uence 


Met 


SEQ 

TD T'JO- 
in 

USSN 
09/496 
914 


Predicted 
beginning 
.nucleotide 
location 
correspond i 
ng to first 

ttillllJU dL-lU 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
Hi pepuue 
sequence 


Amino acid sequence (A= Alanine C=Cysteine, 
D = Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=IIistidine s 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i — i nreonine, v— valine, w— irypiopnan, 
Y^Tyrosine, X=Unknown. *=Stop codon, 
/=possiblc nucleotide deletion, \=possible 
nucleotide insertion 














SGKYA1TARNSF1VLIIFTICFVPYHAFRPIYISS 

v^tjiN »oovj i vv xsjz»i v n.iv i in H/ii vi_i_, vLoor in o v . i_x Jt 

VMYFLMSSNIRKIMCQLLFRRFQGEPSRSEST 
SEFKPGYSLHDTSVAVKIQSSSKST 


S82 


2232 


A 


7617 


67 


379 


ROMA 7 I k' A KJlCVil 1Q. Adl K FFQVT I f~1 \ /'T7W Pi 
iv\^/lvi/Al_<l_,rw-\.l\l rvJ_'J— «1 o/\.OJL^rvXLi o VIOLIN v^i^ V I IN LJ 

PLVSEEDMVTVVEDWMNFYINYYRQQVTGE 
PQERDKALQELRQELNTLANPFLAKYRDFLK 

copy P^HPPP^ 


883 


2233 


A 


7622 


400 


215 


KVKTCRYNPKYSAANDTGFVDIPSREKDLAK 
AVATVGPISVAVGASHVFFQFYKXGKHLSS 


884 


2234 


A 


7638 


2640 


2S61 


APVLILQMVKLSIVLTPQFLSHDQGQLTKELQ 
QHVKSVTCPCEYLRKVSECRQMGPGALEQFP 
GLSCHTSHSG 


OOJ 




A 

r\. 


/ D'ti 


7ft1 


HDD 


rbKOlSJVlJbi^c.AJVloKY 1 brVIMr AVrrrlL. 1 V VLLr 
AIGMFFTAWFFVYEVTSTKYTRDIYKJELLISL 
VASLFMGFGVLFLLLWVG1YV 


OOD 


--JO 


A 




0 1 


Doy 


AFtNri' i>KQHr Nob 1 K VKLbLK J OI WjLGNHA 

HLG EHFSTHHELGLSGK V VGFL VKNILE VIRN 

GGMETRHPGKVSSWFHRWDSRAEQHNHAE 

HHEDVPQGDEDSKVSEAQQEFPDWTCAGLP 

GLLPKALRVLLFQLKVQHRPGIHQQRPEQQD 

VSDHRYGRSVRQNRK 


887 


2237 


A 


7693 


85 


315 


NPGCCLPVAMRTSYLLLFTLCLJLLSEMASGG 

NFLTGLGHRSDHYNCVSSGGQCLYSACPIFTK 

IQGTCYRGKAKCCK 


888 


223S 


A 


7702 


242 


1298 


APSHRRRYLSPSRSAGQLGNMALERLCSVLK 

VLLITVLVVEGIAVAQKTQDGQNIGIKHIPAT 

QCGIWVRTSNGGHFASPNYPDSYPPNKECIYI 

LLAAJ Kt^KJbL 1 rJDEIIY YlbTSFECRr DHLEVR 

DGPFGFSPLIDRYCGVKSPPLIRSTGRFN4WIKF 

SSDEELEGLGFRAKYSFIPDPDFTYLGGILNPIP 

DCQFELSGADG IVRSSQVEQEEKTKPGQAVD 

CIWTIKATPKAKIYLRFLDYQMEHSNECKRNF 

VAVYDGSSSIENLKAKJFCSTVANDVMLKTGI 

G VIRJVI WADEGSRLNRFRMLFTSFGGASPAQA 

ALSFCHSNMCINNSLVCNGVQNCAYPWDEN 

HC 


8S9 


2239 


A 


7707 


185 


2911 


CHYJMNPSTHHPASAGGSILGLFDFFGLGLGE 

MTMDALLARLKLLNPDDLREEIVKAGLKCGP 

ITS TT RFI FEKKL AQ ALLEQG GRL S SF YHHE A 

GVTALSQDPQRILKPAEGNPTDQAGFSEDRDF 

GYSVGLNPPEEEAVTSKTCSVPPSDTDTYRAG 

ATASKEPPLYYGVCPVYED\TARNERIYVYE 

NKK£ALQAVKMJLK.GSRFKAFSTREDAEICFAR 

GICDYFPSPSKTSLPLSPVKTAPLFSNDRLKDG 

LCLSESETVNKERANSYKNPRTQDLTAKLRK 

AVEKGEEDTFSDLIWSNPRYLIGSGDNPTIVQ 

EGCRYNVMHVAAKENQASICQLTLDVLENP 

DFMRLMYPDDDEAMLQKRIRYVVDLYLNTP 

VKNSRNKYDKTPEDVICERSKNKS\^ELKERIR 
EYLKGHYYVPLLRAEETSSPVIGELWSPDQTA 
EASHVSRYGGSPRDPVLTLRAFAGPLSPAKAE 
DFRKLWfCTPPREKAGFLHHVKKSDPERGFER 
VGREL AHELG YP W VE YWEFLG CFVDLSSQE 
GLQRLEEYLTQQEIGKKAQQETGEREASCRD 
KATTSGSNSISVRAFLDEDDMSLEEIKNRQNA 
ARNN SPPT VG AFGHTRCS AFPLEQE ADLIE AA 
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nucl- 
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seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
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Predicted 

beginning 

nucleotide 

location 

eorrespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end j 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D^Aspartic Acid, h- (jiuiarnic /vcia, 
F=Phcnylalaninc, G=G)ycinc, H=Histidinc } 
I=Isoleucine 5 K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proiine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion. \=possible 
nucleotide insertion _j 
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EPGGPHSSRNGLCHPLNHSRTLAGKRPKAPR 

GEEAHLPPVSDLTVEFDKLNLQN1GRSVSKTP 

DESTKTKDQILTSRINAVERDLLEPSPADQLG 

NGHRRTESEMSAR1AKMSLSPSSPRHEDQLEV 

TREPARRLFLFGEEPSKLDQDVLAALECADV 

DPHQFPAVHRWKSAVLCYSPSDRQSWPSPAV 

KGRFKSQLPDLSGPHSYSPGRNSVAGSNPAKP 

GLGSPGRYSPVHGSQLRRMARLAELAAL 

RHMPVIPALWEAEVGGLLEPRSSRSAWATE 


890 
891 


2240 
2241 


A 
A 


7711 
7721 


360 
61 


1175 


KLPWEPSFLIKMQIIRHSEQTLKTALISKNPVL 

VSQYEKLDAGEQRLMNEAFQPASDLFGPITL 

HSPSDWITSHPEAPQDFEQFFSDPYRKTPSPN 

KRSIYIQSIGSLGN 1 Kliabbiti^wi^iu i r 

YGLRVKLLEPVPVSVTRCSFRVNENTHNLQIH 

AGDILKFLKKKKPEDAFCVVGITMIDLYPRDS 

WNFVFGQASLTDGVG1FSFARYGSDFYSMHY 

KGKVKKLKKTSSSDYSIFDNYYIPEITSVLLLR 

SCKTLTHEIGHIFGLRHCQWLACLMQGSNHL 

EEADRRPLNLCPICLHKLQCAVGFSIVERYKA 

L VRWIDDES SDTPG ATPEHSHEDNGNLPKPV 

EAFKEWKEWIIKCLAVLQK 


892 


2242 


A 


7723 


2 


1650 


SAPTAPARPCRAERGSGGGMLALLAASVALA 

VAAGAQDSPAPGSRFVCTALPPEAVHAGCPL 

PAMPMQGGAQSPEEELRAAVLQLRETVVQQ 

KETLASARA1RELTGKLARCEGLAGGKARGA 

GATGKDTMGDLPRDPGHVVEQLSRSLQTLK 

DRLESLEPLPAMPMQGGAQSPEEELRAAVLQ 

LRETVVQQKETLASARA1RELTGKLARCEGL 

AGGKARGAGATGKDTMGDLPRDPGHWEQ 

LSRSLQTLKDRLESLEHQLRANVSNAGLPGD 

FR£VLQQRLGhLERQELI<J5LOAiiLiiUtivaJLi^ri 

NETSAHRQKTESTLNALLQRVTELERGNSAF 

KSPNAFKVSLPLRTNYLYGKIKKTLPELYAFT 

ICLWLRSSASPGMGTPFSYAVPGQANEIVLIE 

WGNNPIELLINDKVAQLPLFVSDGKW^HHICV 

TWTTRDGMWEAFQDGIOCLGTGENLAPWHP1 

KPGGVLILGQEQDTVGGRFDATQAFVGELSQ 

FNIWDRVLRAQE1VNIANCSTNN4PGNIIPWVD 

NNVD VFG G ASK WP VETCEERLLDL 


893 


2243 


A 


7729 


3554 


2419 


LTAGTAMNYPLTLEMDLENLEDLFWELDRL 

DNYNDTSLVENHLCPATEGPLMASFKAVFVP 

VAYSLIFLLGVIGNVLVLVILERHRQTRSSTET 

FLFHLAVAULLL Vribrr A v/viioo vuw vLuir 

LCKTVIALHKVNFYCSSLLLAC1AVDRYLAJV 

HAVHAYRHRRLLSIH1TCGTIWLVGFLLALPEI 

LFAKVSQGHHNNSLPRCTFSQENQAETHAWF 

TSRFLYHVAGFLLPMLVMGWCYVGVVHRLR 

IFLDTLARLKAVDNTCKLNGSLPVAITMCEFL 
GLAHCCLNPMLYTFAGVKFRSDLSRLLTKLG 
CTGPASLCQLFPSWRRSSLSESENATSLTTF 


894 


2244 


A 


7738 


670 


287 


FVTRAGRWGAGARVRGGAGGMASGAARWL 
VLAPVRSGALRSGPSLRKDGDVSAAWSGSGR 
SLVPSRSVIVTRSGAILPKPVI<^SFGLLRVFSI 
VIPFLYVGTLISKNFAALLEEIIDIFVPEDDDDD 

D 


895 
896 


2245 
2246 


A 
A 


7753 
7754 


119 
1 


278 
372 


APYAHSQVHCLDKVCGLLPFLNPEVPDQFYR 

LWLSLFLHAGKEAPHCPRTRPL 

SPAWWNSQQRVVSPFLALLTLEPTFHHLLP.M 
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seq- 
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SEQ ID 
NO: of 
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seq- 
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hod 
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in 
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u^^i lining 

nucleotide 
location 
corresnondi 
ng to first 
amino acid 
residue of 
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sequence 


Predicted end 

nui-ieuiiuc 

location 
corresponding 

fn la^t amino 

acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

naApnarti/' Ar»irl T7— ^3l»it»n-mi^» A 

LJ — ■A.SpaillC' /\C11J, Il = \jIUlaITllC /\C]a 3 

F«PhenyIaIanine, G=G lycine, H=Histidine 3 
I=lsolcucine, K=Lysine, L=Leucine, 

N/l^N/ftMhionine T\I=A^nnr»crinf» P=Prnlinr» 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan ; 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possib!e nucleotide deletion, V=possibie 
nucleotide insertion 














QVSTAALAVLLCTMAI..CNQVLSAPLAADTPT 
ACCFSYTSRQIPQNFIADYFE'rSSQCSKPSVlFL 
TfCRGRQVCADPSEEWVQKYVSDLELSA 


897 


2247 


A 


7761 


1725 


445 


RPRRRGTHIIFSCVLG SFR VS AMFPR V STFLPL 
RPLSRHPLSSGSPETSAAAIMLLTVRHGTVRY 
RSSALLARTKNN1QRYFGTNSV1CSKICDKQSV 
RTEETSKETSESQDSEKENTK1CDLLGHKG MX 
VEL ST VN VRTTKPPKRRPLK SLE ATL GRLRRA 

TpV A T>IS"L~ Y> It?DT ODCT \/ A A A O A \7 A T~\OT nmi/A 
Iti /^N.JvKlill'JLirt.L V AMoA V AUoLrl L>K.Q 

TTKSELLSQLQQI-IEEESRAQRDAKRPKISFSN1 
ISDMKVARSATARVRSRPELR1 QFDEG YDNYP 
GQEKTDDLKKRKNIFTGKRLNIFDMMAVTKE 
APETDTSPSL WD VEF AKQL AT VN EQPLQN GF 
PPT TOWTfcTPf?lc T WPPPTAJTvjf a r;Fnnnr:cFFu 

EHIFLEKHLESFPKQGPIRHFMELVTCGLSKNP 
YLSVKQKVEHIEWFRNYFNEKKD1LKESNIQF 
KLRPWKFLFRNN 


898 


2248 


A 


7775 * 


85 


496 


SCQTTQPPAQSCSTGTMRIMLLFTAILAFSLA 
QSFGAVCKEPQEEVVPGGGRSKRDPDLYQLL 

RDMHDFFVGLMGKRSVQPDSPTDVNQENVP 
SFGILKYPPRAE 


899 


2249 


A 


7785 


179 


703 


PFHLGASSNTFRLQVQTQESKAQKEVKMGFI 

FSKSMNESMKNQKEFMLMNARLQLERQLIM 

QSEMRERQMAMQIAWSREFLKYFGTFFGLA 

AISLTAGAIKKKKPAFLVPIVPLSF1LTYQYDL 

GYGTLLERMKGEAEDILETEKSKLQLPRGM1T 

FESIEKARKEQSRFFIDK 


900 


2250 


A 


7789 


1465 


300 


VWLPLKSYKiRSPSLHCQCEIFREEFLFSSLQE 
GKX)KDTFSKMAMVSEFLKQAWFIENEEQEY 
VQTVKSSKGGPGSAVSPYPTFNPSSDVAALH 
KAIMVKGVDEATIIDILTKRNNAQRQQIKAAY 

AQFDADELRAAMKGLGTDEDTLIEILASRTN 

KEIRDINRVYREELKRDLAKDITSDTSGDFRN 

ALLSLAKGDRSEDFGVNEDLADSDARALYEA 

GERRKGTDWVFNTILTTRSYPQLRRYFQKY 

TKYSKHDMNKVLDLELKGDIEKCLTAIVKCA 

TSKPAFFAEKLHQAMKGVGTRHKALIRIMVS 

RSEIDMNDIKAFYQKMYGISLCQAILDETKGD 

YEKILVALCGGN 


901 


2251 


A 


7796 


2 


807 


VPPUPDR AP Ap r AP APQUP.VT T TAPTI T Ql \/T 

ALLFPSMASMAAIGSCSKEYRVLLGQLQKQT 

DLMQDTSRLLDPYIRIQGLDVPKLREHCRERP 

GAFPSEETLRGLGRRCFLQTLNATLGCVLHRL 

ADLEQRLPKAQDLERSGLNIEDLEKLQMARP 

NILGLRNNIYCMAQLLDNSDTAEPTKAGRGA 

SQPPTPTPASDAFQRKLEGCRFLHGYHRFMH 

S VGRVFSK WGE SPNRSRRHSPHQALRKG VRR 

TRPSRKGKRLMTRGQLPR 


902 


2252 


A 


7802 

i 


2 


721 


TA ARRRQKGTA ARRLQKGTAARRRQKGTAA 
RRRQKGTAARRPQKG'FAARRRQKGTAARRR 
QKGTAAJIRRQICGTAARRPQK.GTAARRRQKG 
TAARRRQKGTAARRRQKGL AI ASRGCPC ASR 
AGGVRGAGSRLRAMAPKVFRQYWDIPDGTD 
CHRKAYSTTSIASVAGLTAAAYRVTLNPPGTF 
LEGVAKVGQYTFTAAAVGAVFGLTTCISAHV 
REKPDDPLNYFLGGCAGGLTLGARTHNYGIG 
AAACVYFGIAASLVKMGRLEGWEVFAKPKV 
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SEQ ID 1 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, Jb— oiuiamic aciu, 
F=Phenylalanine, G=Glycine, H=Histidinc a 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine :i N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown ; *=Stop codon, 

J — nr»ociKlA rmrl<=*ntirl*=» HeletifYTI \=nr)SStble 

nucleotide insertion 


903 


2253 


A 


7807 


1 


584 


PWLPWSDGRAARSSRKCPRSRFPVQVGKMA 

VSTVFSTSSLMLALSRHSLLSPLLSVTSFRRFY 

RGDSPTDSQKDMIEIPLPPWQERTDESIETKR 

ARLLYESRKRGMLENCILLSLFAKEHLQHMT 

EKQLNLYDRLINEPSNDWDIYYWATEAKPAP 

EIFENEVMALLRDFAKNKNKEQRLRAPDLEY 

LFEKPR 


904 


2254 


A 


7813 


40 


821 


" r> a i^T> at nisi CTP a Pl'PiYf A A At PATCTxT FPR9T T O 

AGARLTGWTMNVFRILGDLSHLLAMILLLGK 
IWRSKCCKGISGKSQILFALVFTTRYLDLFTNF 
1 S I YNT VMKV VFLLC A Y YT VYMI YGKFRKTF 
DSENDTFRI.EFLLVPVTGLSFLENYSFTLLEIL 
WTFSIYLESVAILPQLFMISKTGEAET1TTHYL 

T7T7T /" T VD AT VI ATvTWTT? P VnTFISjPVDOT A VV^J 

GVVQTIFYCDFFYLYVTKGRSWDDSNADTGL 
RSYSSI 


905 


2255 


A 


7817 


1399 


881 


LSNKDVLSPQLKDENSKLRRKLNEVQSFSEA 

QTEMVRTLERKLEAKMIKEESDYHDLESVVQ 

QVEQNLELMTKRAVKAENHVVKLICQEISLL 

QAQVSNFQRENEALRCGQGASLTWKQNAD 

VALQNLRVVMNSAQASIEQLVSGAETLNLVA 

EILKSIDR1SEVKDEEEDS 


906 


2256 


A 


7822 


3 


1462 


DSPRNRFEILGRPTRTPTRPGPRPAMEDLDAr . 

LSDLETTTSHMPRSGAPKERPAEPLTPPPSYG 

HQPQTGSGESSGASGDKDHLYSTVCKPRSPK 

PAAPAAPPFSSSSGVLGTGLCELDRLLQELNA 

TQFNITDEIMSQFPSSKVASGEQKEDQSEDKK 

RPSLPSSPSPGLPKASATSATLELDRLMASLSD 

FRVQNHLPASGPTQPPVVSSTNEGSPSPPEPTG 

KGSLDTMLGLLQSDL.bKK(jr Vr 1 l^AKULLUoL 

NKPIAGQVVTALGRAWHPEHFVCGGCSTAL 

RHKMVTALGTHWHPEHFCCVSCGEPFGDEG 

FHEREGRPYCRRDFLQLFAPRCQGCQGPILDN 

YISALSALWHPDCFVCRECFAPFSGGSFFEHE 

GRPLCENHFHARRGSLCATCGLPVTGRCVSA 

LGRRFHPDHFTCTFCLRPLTKGSFQERAGKPY 

CQPCFLKLFG 


907 


2257 


A 


7828 


1792 


1671 


FIYVNQSFAPSPDQEVGTLYECFGSDGKLVLH 
YCKSQAWG 


908 


2258 


A 


7842 


110 


1172 


KLSCPCSHGTRVTAVRGPRLKAGVQWHDLG 
SLQPPPSGLKQSSHLSLSSSWDFRHAPTHPET 
YTCPKMIEMEQAEAQLAELDLLASMFPGENE 

NMNLDVSDEKMAMFSLACILPFKYPAVLPEI 

T\/T>C\/1 T CPQnnTHT XTTni TAFT Ok'T-TPHrrDV 
1 V rvo V LLroIxov^V^ 1 V^J-»JN i UL Irtr i^i^rvriv^ri^ji^ v 

CILNATEWVREHASGYVSRDTSSSPTTGSTVQ 
SVDLIFTRLWIYSHHIYNKCKRKNILEWAKEL 
SLSGFSMPGKPGWCVEGPQSACEEFWARLR 
KI.NnVKRILIRIiREDIPFDGTNDETERQRKFSIF 
EEKVFS VNGARGNHMDFGQL YQFLNTKG CG 
DVFQMFLWV 


909 


2259 


A 


7870 


3067 


2923 


EGICVYTFIYVHMYTRTCMHTYPYMYMN S V 
LI S SEILLIPSKYLFE SK 


910 


2260 


A 


7884 


212 


4874 


GALTWSHPLLAVCPQGVWLGSTPSGSPALLP 
PSHRVNAEPGCVVTNACASGPCPPHANCRDL 
WQTFSCTCQPGYYGPGCVDACLLNPCQNQG 
SCRHLPGAPHGYTCDCVGGYFGHHCEHRMD 
QQCPRGWWGSPTCGPCNCDVHKGFDPNCNK 
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SEQ ID 

JNU. 01 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

peptide 
seq- 
uence 


Met 
nuu 


SEQ 

l VJ IN \J . 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino • 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C=Cysteine 3 

L^ASpuTllC /\C1Q. E» — VJIUltuTllC /\cio 3 

F=Phenylalanine 5 G=Glycine, IWIistidine, 
l^Isoleucine, K=Lysine, L=Leucine 5 
M=Methionine, N=Asparagine ) P= Pro line, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, \V=Tryptophan, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














TNGQCHCKEFHYRPRGSDSCLPCDCYPVGST 

SRSCAPHSGQCPCRPGALGRQCNSCDSPFAEV 

TASGCRVLYDACPKSLRSGVWWPQTKFGVL 

ATVPCPRG ALGLRGAGAAVRLCDEAQG WLE 

PDLFNCTSPAFRELSLLLDGLELNKTALDTME 

AKKLAQRLREVTGHTDHYFSQDVRVTARLL 

AHLLAFESHQQGFGLTATQDAHFNENLLWA 

GSALLAPETGDL WAALGQRAPGG SPG SAGL V 

RHLEEYAATLARNMELTYLNPMGLVTPN1ML 

SIDRMEHPSSPRGARRYPRYHSNLFRGQDAW 

DPHTHVLLPSQSPRPSPSEVLPTSSSIENSTTSS 

WPPPAPPEPEPGISITILLVYRTLGGLLPAQFQ 

AERRGARLPQNP VMNSP V VS VA VFH GRNFLR 

GILESPISLEFRLLQTANRSKAICVQWDPPGLA 

EQHGVWTARDCELVHRNGSHARCRCSRTGT 

FGVLMDASPRERLEGDLELLAVFTHWVAVS 

VAALVLTAAILLSLRSLKSNVRGIHANVAAA 

LGVAELLFLLGTHRTHNQLVCTAWILLHYFF 

LSTFAWLFVQGLHLYRMQVEPRNVDRGAMR 

FYHALGWGVPAVLLGLAVGLDPEGYGNPDF 

CVVISVHEPUWSFAGPVVLVIVMNGTMFLLA 

ARTSCSTGQREAKKTSALTLRSSFLLLLLVSA 

SWLFGLLAVNHSILAFHYLHAGLCGLQGLAV 

LLLFCVLNADARAAWMPACLGRKAAPEEAR 

PAPGLGPGAYNNTALFEESGLIR1TLGASTVSS 

VSSARSGRTQDQDSQRGRSYLRDNVLVRHGS 

AADHTDHSLQAHAGPTDLDVAMFHRDAGA 

DSDSDSDLSLEEERSLSIPSSESEDNGRTRGRF 

QRPLCRAAQSERJLLTHPKDVDGNDLLSYWPA 

LGECE AAPCALQT WG SERRLGLDTSKDAAN 

NNQPDPALTSGDETSLGRAQRQRKGILKNRL 

QYPLVPQTRGAPELSWCRAATLGHRAVPAAS 

YGRIYAGGGTGSLSQPASRYSSREQLDLLLRR 

QLSRERLEEAPAPVLRPLSRPGSQECMDAAPG 

RJLEPKDRGSTLPRRQPPRDYPGAMAGRFGSR 

DALDLGAPREWLSTLPPPRRTRDLDPQPPPLP 

VPSRHPSREALGPLPQ1XRAREDSVSGPSHGP 

STEQLDILSSILASFNSSALSSVQSSSTPLGPHT 

TATPSATASVLGPSTPRSATSHSISELSPDSEPR 

DTQALLSATQAMDLRRRDYHMERPLLNQEH 

LEELGRWGSAPRTHQWRTWLQCSRARAYAL 

LLQHLPVLVWLPRYPVRDWLLGDLLSGLSVA 

IMQLPQGLAYALLAGLPPVFGLYSSFYPVFIY 

FLFGTSRHISVESLCVPGPVDT 


P! 1 

71 J 




A 


7RQ0 




OL/O 


ppriT^p <?<sPQ\/f appii m *2T?n'PTA q\/p\/#t pppi/7 
LL,r\j i oi\-ijOi\oivi^\jjjvi-rVjjL>or vji-<ir\.ij v CjiyiL/tr cnvj 

SCRPKARSSSARWALTCCLVLLPFLAGLTTYL 

LVSQLRAQGEACVQFQALKGQEFAPSHQQV 

YAPLRADGDKPRAHLTVVRQ'IP'IQHFKNQFP 

ALHWEHELGLAFTKNRMNYTNKFLLIPESGD 

YFIYSQVTFRGMTSECSEIRQAGRPNKPDSITV 

VTTKVTDSYPFPTOI T MGTKSVrFVfiWWFO 

PIYLGAMFSLQEGDKXMVNVSDISLVDYTKE 

DKTFFGAFLL 


912 


2262 


A 


7891 


1263 


111 


ACG1RHEGALPGLTATPEAMLRFLPDLAFSFL 
LILALGQAVQFQEYVFLQFLGLDKAPSPQKFQ 
PVPYILKiaFQDREAAATTGVSRDLC^KELG 
VRGNVLRFLPDQGFFLYPKK1SQASSCLQKLL 
YFNLS A IKEREQLTL AQLGLDLGPNS YYNLGP 
ELELALFLVQEPHVWGQ'ITPKPGKMFVLRSV 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteme, 

T^v A rnorti n A nir\ P — OllltHmtC Acid 

F=Phenylalanine, G=Glycine, H=Histidine, 
I=jsoIeucine, K=Lysinc, L=Leucinc, 
M=Methionine, N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine 3 W=Tryptophan, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/ — r»r»c<:ih1e nucleotide deletion \— DOSSlble 
nucleotide insertion 














PWPQGAVHFNLLDVAKDWNDNPRKNFGLFL 

E1LVKEDRDSGVNFQPEDTCARLRCSLHASLL 

WTLNPDQCHPSRKKKAAlPVPiCLSCKNLCH 

RHQLFINFRDLGWHKWIIAPKGFMANYCHGE 

CPFSLTISLNSSNYAFMQALMIIAVDPEIPQAV 

CIPTKLSPISMLYQDNNDNVILRHYEDMVVD 

ECGCG 


913 


2263 


A 


7892 


15 


849 


a cut ~dt> r'orirn, apvmi? PI T HT 1 1 V F AGCTF AL 

YLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAE 

r rft «5PVT RFYRKFHOAYVFLLFCGAYLYKO 

GFAIPGSSFLNVLAGALFGPWLGLLLCCVLTS 

VGATCCYLLSSIFGKQLWSYFPDKVALLQR 

K\nEENRNSLFFFLLFLRJLFPMTPNWFLNLSAPl 

LNIPIVQFFFSVLIGL1PYNFICVQTGS1LSTLTS 

LDALFSWDTVFKLLAIAMVALIPGTLIKKFSQ 

KHLQLNETSTANHIHSRKDT 


914 


2264 


A 


7893 


815 


959 


vcr,w\;wwi TPT TPAT WFAOTFCrSI.RPFVKN 
RLSNITRPFFSKKKKILV 


915 


2265 


A 


7909 


3 


641 


HASGPGGLLRRRRGSGANMPVARSWVCRKT 

YVTPRRPFEKSRLDQELKL1GEYGLRNKREV 

WRVKFTLAKIRKAARELLTLDEKDPRRLFEG 

NALLRRLVRIGVLDEGKMKLDYILGLKIEDFL 

ERRLQTQVFKLGLAKSIHHAHVLIQQCHIRVR 

EQVVNILFFTVRLDSQKHIDFSLCFPIGVANPS 

HV1CRKNASKGQGGAGARDDEEEE 


916 


2266 


A 


7914 


3 


967 


VAHTQWHTCQRLSQLTHRSILKYLLIDTHAC 
QVLILKHTIIASLSLPSCQECFPSSIPSASHMVS 

SILPTGDSWGMLACLCTVLWHLPAVPALNRT 

GDPGPGPSIQKTYDLTRYLEHQLRSLAGTYLN 

YLGPPFNEPDFNPPRLGAETLPRATVDLEVW 

RSLNDKLRLTQNYEAYSHLLCYLRGLNRQAA 

TAELRRSLAHFCTSLQGLLGSIAGVMAALGY 

PLPQPLPGTEPTWTPGPAHSDFLQKMDDFWL 

LK^LQTWLWRSAKX^FNRLKKJ^QPPAAAVT 

LHLGAHGF 


917 


2267 


A 


7921 


2 


1166 


RPRRGQGLVQEVQTENVTVAEGGVAEITCRL 

HQYDGSIVV1QNPARQTLFFNGTRALKDERFQ 

LEEFSPRRVRIRLSDARLEDEGGYFCQLYTED 

THHQIATLTVLVAPENPWEVREQAVEGGEV 

Pi cfT VPP9PPAATT T? WYRDPJCELKGVSSSO 

ENGKV WS V AST VR_FR VDRKDDGGIIICEAQN 

QALPSGHSKQTQYVLDVQYSPTARIHASQAV 

VREGDTLVLTCAVTGNPRPNQIRWNRGNESL 

PERAEAVGETLTLPGLYSADNGTYTCEASNK 

HGHARALYVLVVYGESRLRPTEGGGGAPDP 

GAVVEAQTSVPYAIVGGILALLVFLIICVLVG 

MVWCSVRQKGSYLTHEASGLDEQGEAREAF 

LNGSDGHKRKEEFFI 


918 


2268 


A 


7938 


3 


2653 


"RRRLPPASPPSSSVSSSLSPSAVVMACRWSTK 
ESPRWRSALLLLFLAGVYGNGALAEHSENVH 
ISGVSTACGETPEQIRAPSGIITSPGWPSEYPAK 
IMCSWFIRANPGEUTISFQDFDIQGSRRCNLD 
WLTIETYKNIESYRACGSTIPPPYISSQDH1WIR 
FHSDDNISRKGFRLAYFSGKSEEPNCACDQFR 
CGNGKC1PEAWKCNNMDECGDRSDEEICAICE 
AWPTAAAFQPCAYNQFQCLSRFTKVYTCLP 
ESLKCDGNIDCLDLGDEIDCDVPTCGQWLKY 
FYGTFNSPNYPDFYPPGSNCTWL1DTGDHRK 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 

F=Phcnylalanine, G=Glycine, H=Histidine, 
I=4soleucine, K=Lysine s L=Leucine s 
M=Methionine, N=Asparagine, P=Proline, 
Q=GJiitamine, R=Arginine, S=Serine. 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine ; X-Unknown s *^Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














vilrptdfkldgtgygdyvkjydgleenphk 

llrvl/tafdshafltwsssgqirvhfcadkv 

naargfnatyqvdgfclpweipcggnwgcy 

teqqrcdgywhcpngrdetnctmcqkeefp 

csrngvcyprsdrcnyqnhcpngsdekncff 

cqpgnfhcknnrcvfeswvcdsqddcgdgs 

deencpvivptrvitaavigslicglllvialg 

ctcklyslrmferrsfetqlsr\t:aellrrea 

ppsygqliaqglippvedfpvcspnqasvlenl 

TCT AVR^OI OPT^VTM PM AfXD QC:MI\XrNTPTP'MPA 
Y IVOV<JL<VjI 1 O V IvJLI lVlr\vJl\.oO!Nl W IN tvl 1 1N1 f\ 

RSRHSGSLALVSADGDEWPSQSTSREPERNH 

i niVijLir o v i_ o vjvj i u i £_< l \ r> I v rv vJ i VI r\ vj t\ o k_J m v r\ r\ 

PLPQKVPPTTAVEATVGACASSSTQSTRGGH 

ADNGRDVTSVEPPSVSPARHQLTSALSRMTQ 

GLRWVRFTLGRSSSLSQNQSPLRQLDNGVSG 

REDDDDVEMLIP1SDGSSDFDVNDCSRPLLDL 

ASDQGQGLRQPYNATNPGVRPSNRDGPCERC 

GTVHTAOTPDTn FVT1 TvNFTSDDFAI T I C 


919 


2269 


A 


7951 


1674 


1839 


V VRVTCCPP ARSTTERTO AYDEEDC VEM V AS 
GGWNDVACHTTM^TMCEFDKKNM 


920 


2270 




7953 


47 




GLRCLPHLPAICARRMSPAFRAMDVEPRAKG 
VLLEPFVHQVGGHSCVLRFNETTLCKPLVPRE 
HOFYFTT PAFMRKFTPOVKGICSOT T FflT PT-TW 

RGDVRDRGHGRPVVQPSLEPSLPPTLCFPSLSS 
FSSSWPSAQHLTPSVFNPW 


921 


2271 


A 


7957 


612 


812 


RSGRTVVTGIGYSKALQSSNRNTKSLLQNEF 

MMVYSFRALSFKESTWATFQHGGEATKSRSL 

SSTQ 


922 


2272 


A 


7967 


1443 


1660 


EN1TEKWKEIWMCRGNKKSCCWTFIKDRHLT 
VSCCKSKSGETLLICIFCSNLVGFFFFGIRGFSN 
WELVKPN 


923 


2273 


A 


7981 


1 


3023 


GSAPRAATA1VLARARPPPPPSPPPGLLPLLPPLL 

LLPLLLLPAGCRAI.EETLMDTKWVTSELAWT 

SHPESGWEEVSGYDEAMNPIRTYQVCNVlviSS 

SQNNWLRTGFIWRRDVQRVYVELKFTVRDC 

NSIPNIPGSCKETFNLFYYEADSDVASASSPFW 

MENPYVKVDTIAPDESFSRLDAGRVNTKVRS 

FGPLSKAGFYLAFQDQGACMSLISVRAFYKX 

CASTTAGFA1.FPETITGAEPTSLVIAPGTCIPN 

AVEVSVPLKXYCNGDGEWMVPVGACTCATG 

HEPAAKESQCRPCPPGSYICAKQGEGPCLPCPP 

NSRTTSPAASICTCHNNFYRADSDSADSACTT 

VPSPPRGVISNVNETSLILEWSEPRDLGVRDD 

LLYNVICKKCHGAGGASACSRCDDNVEFVPR 

QLGLSEPRVHTSHLLAHTRYTFEVQAVNGVS 

GKSPLPPRYAAVNITTNQAAPSEVPTLRLHSS 

SGSSLTLSWAPPERPNGVILDYEIVIKYFEKSEG 

MSTVTSQMNSVQLDGLRPDARYWQVRART 

VAGYGQYSRPAEFETTSERGSGAQQLQEQLP 

LIVGSATAGLVFWAVVVIAIVCLRKQRHGS 

DSEYTEKLQQYIAPGMKVYIDPFTYEDPNEA 

VREFAKEIDVSCVKIEEVIGAGEFGEVCRGRL 

KQPGRREVFVAIKTLKVGYTERQRRDFLSEA 

SIMGQFDHPNIIRLEGVVTKSRPVMILTEFME 

NCALDSFLRLNDGQFTV1QLVGMLRGIAAGM 

KYLSEMN Y VHRDL AARNTL VNSNL VCK VSDF 

GLSRFLEDDPSDPTYTSSLGGKIPIRWTAPEAI 

AYRFCFTSASDVWSYGIVMWEVMSYGERPY 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 

i-v a _ — l.z ~ \ *\ X2 — /"iliif ivmif* A r* i ri 
JL/ = A.SparTlC - A \C1Q ; , E, — vJlUlalliiw- n.wu, 

F=PhcnyIalaninc, G=Glycinc : H=Histidinc, 
I=lsoleucine, K=Lysine, L=Leucine 5 
M=Methionine, N=Asparagine, P=ProIine, 
Q=Gtutamine, R=Arginine, S=Serine, 
TWhreonine, V=Valine, W=Tryp t0 P han > 

V — r T\rtr\cir\F> Y=T [nl^nnwn *=Stor> codon. 

/possible nucleotide deletion, \=possible 
nucleotide insertion 














WDMSNQDVmAVEQDYRLPPPMDCPTALHQ 
LMLDCWVRDRNLRPKFSQIVNTLDKLIRNAA 
SLKVIASAQSGMSQPLLDRTVPDYTTFTTVGD 
wt n ATfcTMHR YTCF^FVSAGFASFDLVAOMTA 
EDLLR1GVTLAGHQKK1LSSIQDMRLQMNQT 
i pvnv 


924 


2274 


A 


7985 


1 


503 


FRPRTKKATAMYLEHYLDSIENLPCELQRNf 

QLMRELDQRTEDKKAEIDILAAEYISTVKTLS 

PDQRVERJLQKIQNAYSKCKEYSDDKVQLAM 

otvf MVnfcTT-f TR R 1 DADT ARFEADLKDKMEG 

SDFESSGGRGLKKGRGQKEKRGSRGRGRRTS 

EEDTPKKKKHKGG 


925 


2275 


A 


7994 


447 


589 


LPCSFCAQCMSSFERVWLQQSHFHNPRWNSR 


926 


2276 


A 


7996 


925 


582 


GPCKVCCITLAIMLQCHSFYRKDVQVEHPKS 
LNPKYSQIENFLSADMALKRKCLLSISDLDFW 
IWDAQPVGIMQTLQNLKKIPNPGCFWSQAFQI 
RDTQPILPLGGRYYITIRQ 


927 


2277 


A 


7998 


2 


353 


DihDDr ■N.TCDQTilsjTJQT FVlf AFT TAKOATMKLSV 

CLLLVTLALCCYQANAEFCPALVSELLDFFFI 
SEPLFKLSLAKFDAPPEAVAAKLGVKRCTDQ 
MSLQKRSLIAEVLVK1LKKCSV 


928 


2278 


A 


8004 


130 


588 


LAPLRCQPGTRTQPRSHPAANDPSAAMSAAG 
ARGLRATYHRLLDKVELlslLPEKLRPLYNHPA 
GPRTWFWAPIMKWGLVCAGLADMARPAEK 
LSTAQSAVLMATGFIWSRYSLVIIPKNWSLFA 
VNFFVGAAGASQLFRIWRYNQELICAKAHK 


929 


2279 


A 


8007 


2 


1016 


EFARRRVFIAAREMSLLRSLRVFLVARTGSYP 

AGSLLRQSPQPRHTFYAGPRLSASASSKELLM 

Ki,RKJvl \j X or ViNL/rviS^JuC l ^vj^-L/i-rrv^/ vc-i w a_, 

HIOEAQKEGWSKAAKLQGRKTKEGLIGLLQE 

GNTTVLVEVNCETDFVSRNLKFQLLVQQVAL 

GTMMHCQTLKDQPSAYSKGFLNSSELSGLPA 

GPDREGSLKDQLAI.AIGKLGENMILKRAAWV 

KVPSGFYVGSYVHGAMQSPSLHKLVLGKYG 

ALVICETSEQKTNLEDVGRRLGQHVVGMAPL 

SVGSLDDEPGGEAETKMLSQPYLLDPSITLGQ 

YVQPQGVSWDFVRFECGEGEEAAETE 


930 


2280 


A 


8008 


3 


1679 


NSRVWGPWTEPSAGSLRPMARKQNRNSKEL 

GLVPLTDDTSHAGPPGPGRALLECDHLRSGV 

PGGRRRKDWSCSLLVASLAGAFGSSFLYGYN 

LSVVNAPTPYIKAFYNESWERRHGRPIDPDTL 

TLLWSVTVSIFAIGGLVGTLIVKMIGKVLGRK 

HTLLAKNGFAISAALLMACSLQAGAFEMLIV 

GRFTMGIDGG VALS VLPMYLSEISPKEIRG SLG 

QVTA1FICIGVFTGQLLGLPELLGKESTWPYLF 

GVIWPAWQLLSLPFLPDSPRYLLLEKHNEA 

T) a \ /IT A PHTFT CtK ADV^OFVFFVI AFSRVORS 

IRLVSVXELLRAPYVRWQVVTVIVTMACYQL 

CGLNAIWFYTNSIFGKAGIPPAKIPYVTLSTGG 

IETLAAVFSGLV1EHLGRRPLUGGFGLMGLFF 

GTLTITLTLQDHAPWVPYLSIVGILAIIASFCSG 

PGGIPFILTGEFFQQSQRPAAFIIAGTVNWLSN 

FAVGLLFPFIQKSLDTYCFLVFATICITGAIYL . 

YFVLPETKNRTYAEISQAFSKRNKAYPPEEKI 

DSAVTDGKINGRP 


931 


2281 


A 


8009 


861 


300 


AAGAVVS AN4PKAKGKTRRQKFG Y S VNRKRL 
NRNARRKAAPRIECSHIRHAWDHAKSVRQNL 
AEMGLAVDPNRAWLRKIUCVKAMEVDIEER 
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SEQ ID 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

NO- nf 

peptide 
seq- 
uence 


Met 

iiuu 


SEQ 
m mo* 

in 

USSN 
09/496 
914 


Predicted 

Beginning 

nucleotide 

location 

correspondi 

no tr» ft ret 

amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 

dUlU J CdluUC 

of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
u— a. s panic /vcia, t, — oiutamic Acia, 
F=Phenylalanine, G=Glycine, II=Histidine, 
Msoleucine. K-Lysinc, L^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 

1 » lt"0 rir\ 1 nA T? = A ml nin o O — Coring 

V — uiuitiiiiiiic, iv — rvrginine, o — oenne, 
T=Threonine, V=Valine, W=Tryptophan, 
Y^Tyrosine, X^Unknown, *=Stop codon, 
/possible nucleotide deletion, V=possible 
nucleotide insertion 














L1DY VRYM VENHGEDYKAMARDEKNYYQD 
TPKQIRSKINVYICRJFYPAEWQDFLDSLQKRK 
MEVE 


932 


2282 


A 


son 


412 


1 


SNLCLGN S WR WR W AK SRHHC1PT VTLSKRS G 

DIRGSHFSSPQRQRSQRVPGKETARVLRAGK 

QGRGQIP1PCPWPPPPPPPPPGSPGPGCRQFHQ 

SLEAKAIIHPASVREMRGKVKMRRALRRAPA 

STRASSRQPNPK 


933 


2283 


A 


8012 


147 


1077 


PPVPPASRSDMAQNLKDLAGRLPAGPRGMGT 

A LrLL L 1. .O AO A V A Y O V Kt5> V F T VEG GHRAIF F 

NRJGGVQQDTILAEGLHFRIPWFQYPIIYDIRA 

RPRKISSPTGSKJ2LQMVNISLRVLSRPNAQEL 

PSMYQRLGLDYEERVLPSIViraVLKSVVAKF 

NASQLITQRAQVSLLIRRELTERAKJDFSLILDD 

VAITELSFSREYTAAVEAKQVAQQEAQRAQF 

LVEKAKQEQRQKIVQAEGEAEAAKMLGEAL 

SKNPGYlKLRKlRAAQNISKTIATSQNRrYLTA 

DriL, VLiNJLl^UJbbr 1 KOoJJoLlKGKK 


934 


2284 


A 


8023 


255. 


982 


SQFSLSQVLVDSAEEGSLAAAAELAAQKREQ 

RLRKFRELHLMRNEARKLNHQEVVEEDKRL 

KLr AlN WtAisAAKLb W bLI^hbKKJUiC AARC t 

EDYEKVKXLEISAEDAERWERKJCKRKNPDLG 

FSDYAAAQLRQYHRLTKQIKPDMETYERLRE 

KHGEEFFPTSNSLLHGTHVPSTEEIDRMVIDEE 

KQIEKRDKYSRRRPYNDDADIDYINERNAKF 

NKKAERFYGKYTAEIKQNLERGTAV 


935 


2285 


A 


8027 


59 


310 


L V S ST VN LLTEKAP WNSL A WTVTS Y VFLKJFL 

nnnr.THCTr.N/fD r^o a r ~rr j njt~" dcltd uoi crar 
^uuu 1 Ob I OMKJJbAJL 1 JLIAjlOr'bnKJ^oLblKJL 

oynoor r\r ivj i ov^ i .r MIL V l^O 


936 


2286 


A 


8032 


1 


639 


SGRECNMAICTYDYLFKLLLIGDSGVGKTCVL 

FRFSEDAPNSTFISTIGIDFKIRTIELDGKRIKLQ 

IWDTAGQERFRTITTAYYRGAMGIMLVYDIT 

NEKSFDNIRNWIRNIEEHASADVEKA1ILGNKC 

DVNDKRQVSKERGEKLALDYGIKFMETSAX 

ANINVENAFFTLARDIKAKMDKKLEGNSPQG 

^>jnnvi«rTTPT^nni<rpQQPr , T>p\/r t 

OINV^VJ V Ivl 1 1 i^v<V^JSJLS.Oorl 1\XsVLj1-> 


937 


2287 


A 


8039 


393 


311 


EETIHSENSYILEKYIPISANLTLTIA 


938 


2288 


A 


8052 


675 


•1334 


LHPAATSTAWLHVPPGLSMALSWVLTVLSLL 

PLLEAQIPLCANLVPVPITNATLDRITGKWFYI 

ASAFRNEEYNKSVQEJQATFFYFTPNKTEDTIF 

LREYQTRQDQCIYNTTYLNVQRENGTISRYV 

GGQEHFAHLLILRDTKTYMLAFDVNDEKNW 

GLS V YADKPETTKEQI .GEFYEALDCLRIPKSD 

VVYTDWKKDKCEPLEKQHEKERKQEEGES 


939 


2289 


A 


8055 


12 


1039 


SSVAEFPERVQLSQPQNWNFSGAGGAWSLDF 
AEQLK WS AEL ARLGESIMDGKQGGMDG SKP 
AGPRDFPGIRLLSNPLMGDAVSDWSPMHEAA 

Tlir'LIAT CI T) "KIT TOAPM/ A ^/\IITT A r\U\ ^CTiT T TT7 A 

lHOHQLbLKNLlbQOWAVNin ADH V bPLHEA 

CLGGHLSCVKILLKHGAQVNGX'TADWHTPL 

FNACVSGSWDCVNI I T OHGA^VOPF^DT A<;P 

1HE.AARRGHVECVNSLIAYGGNIDHKISHLGT 

PLYLACENQQRACVKKLLESGADVNQGKGQ 

DSPLHAVARTASEELACLLMDFGADTQAKN 

AEGKRP\^ELVPPESPLAQLF2.EREGPPSLMQL 

CRUURKCFGIQQHHKITKLVLPEDLKQFLLH 

L 


940 


2290 


A 


8058 


2 


1203 


KVLSIREPAHSTARIvASEPSQPSQPSQPGGHLI 
ARLRTMDLHLFDYSEPGNFSDISWPCNSSDC1 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-AIanine C Cysteine, 
JL>— Aspartic Acid, ijiuiarniL /vuiu, 
F=Phenylalanine, G=Glycine, H=Histidine, 
Wsoleucine, K^Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine s W=Tryptophan, 
Y=Tyrosine, X^Unknown, *^Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














VVDTVMCPNMPNKSVLLYTLSFIYIFIFVIGMI 

ANSVVVWVN1QAKTTGYDTHCY1LNLAIADL 

unnn Tn)\^ww?i VOIJ^OWPMOELTCKVTH 

LIFSINLFGSIFFLTCMSVDRYLSITYFTNTPSS 

RKKMVRRWCILVWLLAFCVSLPDTYYLKT 

VTSASNNETYCRSFYPEHSIKEWLIGMELVSV 

\n ri7AVPP"5nAVF\TT 1 ARAISASSDOEKHSS 

RK1IFSYVWFLVCWLPYHVAVLLDIFSILHYI 

PFTCRLEHALFTALHVTQCLSLVHCCVNPVL 

YSFINRNYRYELMKAFIFKYSAKTGLTKLIDA 

SRVSETEYSALEQSTK 


941 


2291 


A 


8059 


73 


432 


"DMAGLMTnrrSLLFLGVCAHHllPTGSWLPS 
PCCMFFVSKR1PENRVVSYQLSSRSTCLKAGV 
IFTTKKGQQFCGDPKQEWVQRYMKNLDAKQ 
KKASPRARAVAVKGPVQRYPGNQTTC 


942 


2292 


A 


8067 


278 


1262 


GGIGEIKQRPSCLGRCLDPSLSVLMNISLGLGS 
VFSAVISQKPSRDICQRGTSLTIQCQVDSQVT 

. , x , r «;vn nnurnQT Ti l A T A MfY^P ATYF^frF 
MiViF W YKQQF^QcvLi L.1AI AlNv^ooJOV i i nour 

VIDKFPISRPNLTFSTLTVSNMSPEDSSIYLCSA 

GRQGTYEQYFGPGTRLTVTEDLKNVFPPEVA 

VFEPSEAE1SHTQKATLVCLATGFYPDHVELS 

WWVNGKEVHSGVSTDPQPLKEQPALNDSRY 

CLSSRLRVSATFWQNPRNHFRCQVQFYGLSE 

NDEWTQDRAKPVTQIVSAEAWGRADCGFTS 

ESYQQGVLSATILYEILLGKATLYAVLVSALV 

LM AM VKRKD SRG 


943 


2293 


A 


8070 


1 


879 


MVKVVPATRGNLPRSQLTGTHQHCQPREPKI 

<tp a nnni t> T> "D dd ATARI I? A 14 A APPPPPT AVFAP 

PSDRKELLALPVACDPVIASVMSWVQAASLI 

QGPGDKGDVFDEEADESLLAQREWQSNMQR 

RVKEGYRDG1DAGKAVTLQQGFNQGYKKGA 

EVILNYGRLRGTLSALLSWCHLHNNNSTLINK 

INNLLDAVGQCEEYVLKHLKSITPPSHVVDLL 

DSIEDMDLCHVVPAEKKIDEAKDERLCENNA 

EFNKNCSKSHSGIDCSYVECCRTQEHAHSGK 

PKPHMDFGTDSQF 


944 


2294 


A 


8073 


1 


797 


MAATSGTDEPVSGELVSVAHALSLPAESYGN 
DPDIEMAWAMRAMQHAEVYYKLISSVDPQF 
LKLTKVDDQI Y SEFRKNFETLR1DVLDPEELK 
SESAKEKWPJPFCLBCFNGIVEDFNYGTLLRLD 
CSQGYTEENTIFAPR1QFFAIEIARNREGYNKA 
\ / v i c\/ nnif P nv K fl VMN n GF K R A D S GEEENT 
KNGGEKGADSGEEKEEGINREDKTDKGGEK 
GKEADICE1NKSGEKAM 


945 


2295 


A 


8074 


2 


505 


GAATLLRSASSAAJUCAAEAEQVWLHLHRYL 

SADRRVLGLREWGRPASERECSLCQRLKREL 

NMGDVEKGKKIFIMKCSQCHTVEKGGKHKT 

r-«T>xTT i-ir;T FnPT^TnOAPrrYSYTAANKNlCGirVV 

GEDTLMEYLENPKKYIPGTKMIFVGIKKKEER 

ADL1AYLKKATNE 


946 


2296 


A 


8081 


42 


590 


" EGRRGKFGGKLCNFLFYFHSNSAESRMDVLF 
VAIFAVPLILGQEYEDEERLGEDEYYQWYY 
YTVTPSYDDFSADFTIDYSIFESEDRLNRLDK 
DITEAIETT1SLETARADHPKPVTYKPVTTEPQ 
SPRSEAMPCPVLRSPIPLPPVRVPLFRWGCISC 
KKVGRRLLMTLWMGVWQEE1GR 


947 


2297 


A 


8084 


322 


549 


" GGG S SPREL AG AAGLT VTSQ AV AARRQQPSF 
SRARAPAHSLRAALSLASSARSWGAVSRDRG 
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in 

USSN 
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914 
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beginning 

nucleotide 
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corresponds 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C=Cysteine, 
D=A^nartic Acid Fssfilntpmir Ariri 
F~Phenvlalanine G=Glvcine H=Hi«;tidine 
I=lsolcucinc, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine 3 P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y= Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possibie 
nucleotide insertion 














PCPPAIMYQSSNKC 


948 


2298 


B 


8093 


3905 


846 


MEPGE VKDRILENISL S VKKLQS Y FAACEDEl 

PAIRNHDKVLQRLCEHLDHALLYGLQDLSSG 

YWLVVHFTRREAIKQIEVLQHVATNLGRSR 

AWLYLALNENSLESYLRLFQENLGLLHKYYV 

KNALVCSHDHLTLFLTLVSGLEFIRFELDLDA 

PYLDLAPYMPDYYKPQYLLDFEDRLPSSVHG 

SDSLSLNSFNSVTSTNLEWDDSAIAPSSEDYD 

FGDVFPAVPSVPSTDWEDGDLTDTVSGPRST 

ASDLTSSKASTRSPTQRQNPFNEEPAETVSSS 

DTTPVHTTSQEKEEAQALDPPDACTELEVIRV 

TKJCKKIGKKKKSRSDEEASPLHPACSQKKCA 

KQGDGDSRNGSPSLGRJDSPDTMLASPQEEGE 

GPSSTTESSERSEPGLLIPEMICDTSMERLGQPL 

SKVIDQLNGQLDPST WCSRAEPPDQSFRTG SP 

GDAPERPPLCDFSEGLSAPMDFYRFTVESPST 

VTSGGGHHDPAGLGQPLHVPSSPEAAGQEEE 

GGGGEGQTPRPLEDTTREAQELEAQLSLVRE 

GPVSEPEPGTQEVLCQLKRDQPSPCLSSAEDS 

GVDEGQGSPSEMVHSSEFRVDNNHLLLLMIH 

VFRENEEQLFKMIRMSTGHMEGNLQLLYVLL 

TDCYVYLLRKGATEKPYLVEEAVSYNELDY 

VSVGLDQQTVKLVCTNRRKQFLLDTADVAL 

AEFFLASLKSAMIKGCREPPYPSILTDATMEK 

LALAKFVAQESKCEASAYTVRFYGLVHWED 

SYLGKEHWKTCFVVLSNGILYQYPDRTDVIP 

LLSVNMGGEQCGGCRRANTTDRPHAFQVILS 

DPPCLELSAESEAEMAEWMQHLCQAVSKGVI 

PQGVAPSPCIPCCLVLTDDRLFTCHEDCQTSF 

FRSLGT AKLGDIS A VSTEPG KE YC VLEFS QDS 

QQLLPPWVIYLSCTSELDRLLSALNSGWKTIY 

QVDLPHTAIQEASNKKKFEDALSUHSAWQR 

SDSLCRGRASRDPWC* 


949 


2299 


A 


8095 


9 


2374 


ARRADTVLLESPSMLQGLLPVSLLLSVAVSAI 

KELPGVKKYEVWPIRLHPLHKREAKEPEQQ 

EQFETELK YKMTTNGK1A VL YLKKNKNLL AP 

GYTETYYNSTGmTTSPQIMDDCYYQGHILN 

EKVSDASISTCRGLRGYFSQGDQRYFIEPLSPI 

HRDGQEILAXFKYNPDEKNYDSTCGMDGVL 

WAHDLQQNIALPATKLVKLKDRXVQEHEKY 

IEYYLVLDNGEFKRYNENQDEIRKRVFEMAN 

Y VNML YKKLNTH V AL VGMEI WTDKDKIKIT 

PNASFTLENFSKWRGSVLSRRKRHD1AQLITA 

TELAGTTVGLAFMSTMCSPYSVGVVQDHSD 

'NLLRVAGTMAHBMGHNFGMFHDDYSCKCPS 

TICVMDKALSFYIPTDFSSCSRLSYDKFFEDKL 

SNCLFNAPLPTDIISTPICGNQLVEMGEDCDC 

GTSEECTNICCDAKTCKIKATFQCALGECCEK 

CQFKKAGMVCRPAKDECDLPEMCNGKSGNC 

PDDRFQVNGFPCHHGKGHCLMGTCPTLQEQ 

CTELWGPGTE VADKSC YNRNEG G SK YG YCR 

RVDDTL1PCKANDTMCGKLFCQGGSDNLPW 

KGRIVTFLTCKTFDPEDTSQEIGiVfVANGTKCG 

DNKVCrNAECVDIEKAYKSTNCSSKCKGHAV 

CDHELQCQCEEGWIPPDCDDSSVVFHFSIVVG 

VLFPNIAN^FVWAMVIRHQSSREKQKKDQRP 

LSTTGTRPHKQKJUCPQMVKAVQPQEMSQMK 

PHVYDLPV^GNEPPASFHKDTNALPPTVFKD 

NPMSTPKDSNPKA 
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SEQ ID 

, NO: of * 
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eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alamne C=Cysteine, 

r\ A onortio AoiH TH^frliitamtc Acid 

U — ASpaniC /VUiU, c — uiuiaiiiiv' .rvw*j, 

F=Phenylalanine, G=Glycine, H=Histidine, 
Wsoleucine, K=Lysine, L=Leucine, 
M^Mcthionine, N-Asparagine, P=Proiine, 
Q=Glutamme, R=Arginine, S=Serine, 
T=Threonine 3 V=Valine, W=Tryptophan : 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


950 


2300 


A 


8100 


1 


1251 


MGLLLMlLASAVLGSFLTLLAQFFLLYRRQPb 

PPADEAARAGEGFRYIKPVPGLLLREYLYGG 

GRDEKPSGAAPEGGATPTAAPETPAPPTRETC 

YFLNATILFLFRELRDTALTRRWTKK1KVEF 

EELLQTKTAGRLLEGLSLRDVFLGETVPFIKTI 

i?i \APP\/vn>^ATOFPr>nPFGEALPAACPEELAF 

EAEVEYNGGFHLAIDVDLVFGKSAYLFVKLS 

RVVGRLRLVFTRVPFTHWFFSFVEDPL1DFEV 

RSQFEGRPMPQLTSIIVNQLKKIIKRKHTLPNY 

KTRFKPFFPYQTLQGFEEDEEHIHIQQWALTE 

GRLK VTLLECSRLLIFG S YDREAN VHCTLELS 

SSVWEEKQRSSIKTGTISLTAVFMGWHRVSE 

AFPGLWYKLLVDLPFWGLEDGGPLLTVPLRQ 

CPG 


951 


2301 


A 


8108 


1612 


839 


tt\/ a t rrpcuAAPriwrVT FHVT DSIENLPFELOR 

NFQLMRDLDQRTEDLKAEIDKLATEYMSSAR 

SLSSEEKLALLKQIQEAYGKCKEFGDDKVQL 

AMQT YEMVDKHIRRLDTDLARFEADLKEK Ql 

ncc nvrjQ cccr ai: ir \r HftTOKF KKAARARSKG 

KNSDEEAPKTAQKKLKLVRTSPEYGMPSVTF 

GSVHPSDVLDMPVDPNEPTYCLCHQVSYGE 

MIGCDNPDCSIEWFHFACVGLTTKPRGKWFC 

PRCSQERKKK 


952 


2302 


A 


8112 


595 


291 


PSVASLARRFSGRALWPPSHSVPGNRALCPRL 
LHGTTLPGGNQRELARQKNMKJCQSDSVKGK 
RRDDGLSAAARKQRDSTPRDSEIMQQKQKK 


953 


2303 


A 


8118 


1 


669 


VCAGIRDPCSTPLAKPAAGGAENLSFGKQPG 

LETNILKMTTPNKTPPGADPKQLERTGTVREI 

GSQAVWSLSSCKPGFGVDQLRDDNLETYWQ 

SDGSQPHLVNIQFRRK1TVKTLC1YADYKSDE 

SYTPSKISVRVGNNFHNLQE1RQLELVEPSGW 

IHVPLTDNHKKPTRTFM1QIAVLANHQNGRD 

THMRQIKIYTP VEES SIGKFPRCTTIDFMM YRS 

TR 


954 


2304 


A 


8133 


66 


1015 


PPLPPRSFPNLFSRPEPLPEPGRRGCNRSREPA 

ada r»cT>T>t>DT7cri a pr,u A X/fVKVTF'MSALAOKE 

AKKDEPKSGEEALIIPPDAVAVDCKDPDDVV 

PVGQRRAWCWCMCFGLAFMLAGVILGGAY 

L YKYF ALQPDD V Y YCGIKYIKDD VILNEPS AD 

APAALYQTIEENIKIFEEEEVEFISVPVPEFADS 

DPANIVHDFNKKLTAYLDLNLDKCYV1PLNT 

SIVMPPRNLLELLINIKAGTYLPQSYLIHEHMV 

ITDRIENIDHLGFFIYRLCHDKETYKLQRRETI 

KGIQKRE ASNCF AIRHFENKF A VETLIC S 


955 


2305 


A 


8143 


35 


1171 


VESRSAWHEGEDQTDRLDFIRNQMNLLTLDV 

ICKKIKE VTEE V ANKV SCAMTDEICRLS VL VD 

EFCSEFHPNPDVLKIYKSELNKHIEDGMGRNL 

a t>t3 /~ ,r rrYC\/"M A T \n nTOOFTTPNT KPLLPAGIO 

DKLHTLIPCKICFDLSYNLNYHKLCSDFQEDIV 

FRFSLGWSSLVHRFLGPRNAQRVLLGESEPIF 

QLPRSLASTPTAPTTPATPDNASQEELMITLVT 

GLASVTSRTSMGIUVGGVrWKTlGWKLLSVS. 

LTMYGALYLYERLSWTTHAKERAFKQQFVN 

YATEKLRMIVSSTSANCSHQVKQQIATTFARL 

CQQVDITQKQLEEEIARLPKEIDQLEKIQNNS 

KLLRNKAVQLENELENFTKQFLPSSNEES 


956 


2306 


A 


8157 


1854 


798 


ASGSPAPSSSSAMAAACGPGAAGYCLLLGLH 

LFLLTAGPALGWNDPDRMLLRDVKALTLHY 
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NO: of 
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SEQ ID 
NO: of 
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seq- 
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hod 
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ID NO: 
in 
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Predicted 

L/^- i— i till n iag 

nucleotide 
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correbpondi 
ng to first 
amino acid 
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Predicted end 

JiUUICUlJUC- 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

i~J — SXit^jtll lit-- /itlUj C — V_7 J Ulciil J 1 U /\CIQ, 

F=Phenylalanine, G=Glycinc, H=Histidine, 
F=Isoleucine, K=Lysine, L=Leucine s 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
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Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion. \=possiblc 
nucleotide insertion 














DRYTTSRRLDPIPQLKCVGGTAGCDSYTPKVI 

VVSCEG YESSEDQYVLRGSCGLEYNLDYTEL 

GLQKLKESGKQHGFASFSDYYYKWSSADSC 

NMSGLITIVVLLGIAFWY1CLFLSDGQYSPPP 

YSEYPPFSHRYQRFTNSAGPPPPGFKSEFTGPQ 

NTGHGATSGFGSAFTGQQGYENSGPGFWTGL 

GTGG1LG YLFGSNRAATPFSDS W YYPS YPPS Y 

PGTWNRAYSPLHGGSGSYSVCSNSDTKTRTA 

SGYGGTRRR 


957 


2307 


A 


8159 


1492 


528 


THVVMTGMCYAPHQVLSYINGVTTSKPGVSI. 
VYSMPSRNLSLRLEGLQEKDSGPYSCSVNVQ 

niv fVTK"'£l?r"lI-TQIl/ T*I l^'l "MVI VPDADDQrDI r\r*\/ 
JUJx^iOiVoi\.OriolK. i L-CLiN V.L, V r r Ar r o L KLy u V 

PHVGANVTLSCQSPRSKPAVQYQWDRQLPSF 

QTFFAPALDVIRGSLSLTNLSSSMAGVYVCKA 

I-fNEVGTAQCNVTLEVSTGPGAAVVAGAVVG 

TLVGLGLLAGLVLLYHRRGKALEEPANDIKE 

DAIAPRTLPWPKSSDTISKNGTLSSVTSARAL 

RPPHGPPRPGALTPTPSLSSQALPSPRLPTTDG 

AHPQPISP1PGGVSSSGLSRMGAVPVMVPAQS 

QAGSLV ! 


958 


2308 


A 


8161 


2340 


1192 


ELARRPKQQSSEK SRNMIRN WLTIFILFPLKL V 

EKCESSVSLTVPPVVIOJ2NGSSTNVSLTLRPP 

LNATLVITFEITFRSKNITILELPDEVVVPPGVT 

Noor^ V 1 bt^N VUl^L 1 V YLHuNHSNQTGPRJR 

FLVIRSSAISITNQVIGWIYFVAWSISFYPQVIM 

NWRRKSVIGLSFDFVALNLTGFVAYSVFNIGL 

LWVPYIKEQFLLKYPNGVNPVNSNDYFFSLH 

AVVLTLI1IVQCCLYERGGQRVSWPAIGFLVL 

A WLFA F VTMI VA A V GV JT WLQFLFCFS YJKL 

AVTL VKYFPQ AYMNFY YKSTEG W S1GNVLL 

DFTGGSFSLLQMFLQSYNNDQWTLIFGDPTK 

FGLGVFSIVFDVVFFIQHFCLYRKRPGYDQLN 


959 


2309 




O 1 UJ 






fiCD A/lDDDflDl f~l\7Wi A /"YD^T}! T T>T> - D\7/~> C?T> r> T7 

UJbKAOKJsJvOiCLUV W AQrQrLLrKPVGbKlvb 

MQPPGPPPAYAPTNGDFTFVSSADAEDLSGSl 

ASPDVKLNLGGDFIKESTATTFLRQRGYGWL 

LEVEDDDPEDNKPLLEELDIDLKDIYYKIRCV 

LMPMPSLGFNRQVVRDNPDFWGPLAVVLFFS 

MISLYGQFRWSWIITIW1FGSLTIFLLARVLG 

GEVAYGQVXGVIGYSLLPUVIAPVLLVVGSF 

EVYSTLIICLFGVFWAAYSAASLLVGEEFKTK 

KPLLIYPIFLLYIYFLSLYTGV 


960 


2310 


A 


8167 


1 


2921 


MTCFKG QKGEQRSHAFEANKDHKAKVPSPN 

LYSQLNALQFTVDERSILWLNQFLLDLKQSL 

NQFMAVYKLNDNSKSDEHVDVRVDGLMLK 

FVIPSEVKSECHQDQPRAISIQSSEMIATNTRH 

CPNCRHSDLEALFQDFKDCDFFSKTYTSFPKS 

CDNFNLLHP1FQRHAHEQDTKMHEIYKGNITP 

QLNKNTLKTSAATDVWAVYFSQFWIDYEGM 

KSGKGRPISFVDSFPLSIWICQPTRYAESQKEP 

QTCNQVSLNTSQSESSDLAGRLKRKKLLKEY 

YSTESEPLTNGGQKPSSSDTFFRFSPSSSEADI 

HLLVHVHKHVSMQINm^QYLLLLFLHESLILL 

SENLRKDVEAVTGSPASQTSICIGELLRSAELA 

LLLHPVDQANTLKSPVSESVSPWPDYLPTEN 

GDFLSSICRKQISRDINRIRSVTVNHMSDNRSM 

SVDLSHIPLKDPLLFKSASDTNLQKGISFMDY 

LSDKHJ.GKISEDESSGLVYKSGSGEIGSETSD 

KKDSFYTDSSSVLNYREDSNILSFDSDGNQNI j 

LSSTLTSKGNETIESIFKAEDLLPEAASLSENL [ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SKQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO : 
in 

USSN 
09/496 
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Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C-Cysteine, 
D=Aspartic Acid, E^Glutamic Acid, 
FHPhenylalanine, G=Glycine, H=Histidine, 
Msoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine 9 R=Arginine, S=Serine, 
T=Threonine, V=Valme, W=l ryptophan. 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possiblc 
nucleotide insertion 














DTSKEETPPVRTLKSQSSLSGKPKERCPPNLAP 

LCVSYKNMKRSSSQMSLDTISLDSMILEEQLL 

ESDGSDSHMFLEKGNKKNSTTNYRGTAESVN 

AGANLQNYGETSPDAISTNSEGAQENHDDLM 

SVVVFKITGVNGblDiKublJ 1 JblClAJVN v v i r 

DQLGNISLRHYLCNRPVGSDQKAVIHSKSSPE 

ISLRFESGPGAVIHSLLAEKNGFLQCHIENFST 

EFLTSSLMNIQHFLEDETVATVMPMKIQVSNT 

KINLKDDSPRSSTVSLEPAPVTVHIDHLVVER 

SDDGSFHIRDSHMLNTGNDLKENVKSDSVLL 

TSGKYDLKKQRSVTQATQTSPGVPWPSQSAN 

FPEFSFDFTREQLMEENESLKQELAKAKMAL 

AE AHL E KX> A L L HH IK Jvivl l Vb 


961 


2311 


A 


8172 


1442 


682 


TAAMSIFTPTNQ1RLTNVAVVRMKRAGKRFEI 

ACYKNKVVGWRSGVEKDLDEVLQTHSVFVN 

VSKGQVAKKEDLISAFGTDDQTEICKQILTKG 

EVQVSDKERH TQLtQNlrKXJlAllv/VJJis.v^ viNr 

ETKRPYTVILIERAMKDIHYSVKTNKSTKQQA 

LEVIKQL1<LEICV1KIERAHMRLRJF1LPVNEGKKL 

ICEKLKPLIKV1ESEDYGQQLEIVCLIDPGCFREI 

DELIKKETKGKGSLEVLNLKDVEEGDEKFE 


962 


2312 


A 


8175 


286 


587 


N1SNKAEVSSHPSVISHSMDSFGQPRPEDNQS 
VLRRMQKKYWKTKQVFIKATGKKEDEHLVA 
SDAELDAICLEVFHSVQETCTELLKIIEKYQLR 
LNGMKS 


963 


2313 


A 


8181 


13 


2215 


AEGCAERRGTEPVVELSMSWESGAGPGLGSQ 

GMDLV W S A W YGKCVKGKGSLPLSAHGIW 

AWLSRAEWDQYTVYLFCDDHKLQRYALNRJ 

TVWRSRSGNELPLAVASTADLIRCKL1..DVTG 

GLGTDELRLLYGMALVRFVNLISERlCrKFAK 

VPLKCLAQEVNIPDW1VDLRHELTHKKMPHI 

NDCRRGCYFVLDWLQKTYWCRQLENSLRET 

WELEEFREGIEEEDQEEDKNIWDDITEQKPE 

PQDDGKSTESDVKADGDSKGSEEVDSHCKK 

ALSHKELYERARELLVSYEEEQFTVLEKFRYL 

PKAIKAWNNPSPRVECVLAELKGVTCENREA 

VLDAFLDDGFLVPTFEQLAALQIEYEENVDL 

NDVLVPKPFSQFWQPLLRGLHSQNFTQALLE 

RMLSELPALGISGIRPTYILRWTVELIVANTKT 

GRNARRFSAGQWEARRGWRLFNCSASLDWP 

RMVESCLGSPC W ASPQLLRi 1 r KAMUQLrLi'U 

EEQEKLLR1CSIYTQSGENSLVQEGSEASPIGK 

SPYTLDSLYWSVKPASSSFGSEAKAQQQEEQ 

GSVNDVKEEEKEEKEVLPDQVEEEEENDDQE 

EEEEDEDDEDDEEEDRMEVGPFSTGQESPTA 

ENARLLAQKRGALQGSAWQVSSEDVRWDTF 

PLGRMPGQTEDPAELMI.ENYDTMYLLDQPV 

LEQRLEPSTCKTDTLGLSCGVGSGNCSNSSSS 

NFEGLLWSQGQLHGLKTGLQLF 


964 


2314 


A 


8184 


6 


1393 


EPRRNFRDDSTRPRTRGRTRGRRRRACRSAE 
GTGLRSLLLPPRLQLPAGPFSRCRWDPVSSPR 
PSTMPPKKGGDGIKPPPI1GRFGTSLKIGIVGLP 
NVGKSTFFNVLTNSQASAENFPFCTIDPNESR 
VPVPDERFDFLCQYHKPASKIPAFLNVVDIAG 
LVKGAHNGQGLGNAFLSHISACDGIFHLTRA 
FEDDDITHVEGSVDPIRDIEIIHEELQLKDEEMI 
GPIIDKLEKVAVRGGDKKLKPEYDIMCKVKS 
WVIDQKKPVRFYHDWNDKEIEVLNKHLFLTS 
KPMVYLVNLSEKDYIRKKNKWLIKIKEWVD 
| KYDPGALVIPFSGALELKLQELSAEERQKYLE 
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SEQ ID 

inU. 01 

nucl- 
eotide 
seq- 
uence 


SEQ ID 
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peptide 
seq- 
uence 


Met 
noo 


SEQ 
m "Kirv 

ILf IN 

in 

USSN 
09/496 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to Liibi 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=GIycine ? H=I listidine, 
J=Iso]eucine ; K=Lysine, L=Leucine. 
M=Methionine, N=Asparagine, P=Proline, 
\g= Lnuiarnine, k— /vrginine, o— serine, 
T=Threonine, V=Valine ? W=Tryptophan, 
Y=Tyrosine. XHJnknown. *=Stop codon, 
/=possible nucleotide deletion, \«possibie 
nucleotide insertion 














ANMTQSALPKIIKAGFAALQLEYFFTAGPDEV 
RAWTIRKGTKAPQAAGK1HTDFEKGF1MAEV 
MKYEDFKJEEGSENAVKAAGKYRQQGRNYIV 
EDGDIIFFICFNTPQQPKKK 




93 1 ^ 


A 


5 1 7J 


1/137 




Kbr bLor oLLbr bbMMALuAAG A l KVr VAMV 

AAALGGHPLLGVSATLNSVLNSNAIKNLPPPL 

GGAAGHPGSAVSAAPGILYPGGNKYQT1DNY 

QPYPCAEDEECGTDEYCASPTRGGDAGVQIC 

LACRKRRKRCMRHAMCCPGNYCKNGICVSS 

DQNHFRGEIEETITESFGNDHSTLDGYSRRTT 

LSSKMYHTKGQEGSVCLRSSDCASGLCCARH 

FWSKICKPXO.KEGQVCTKHRRKGSHGLEIFQ 

RCYCGEGLSCRIQKDHHQASNSSRLHTCQRH 


966 


2316 


A 


8207 


416 


4082 


KFKLIKIMLLTLIILLPVVSKFSFVSLSAPQHW 

SCPEGTLAGNGNSTCVGPAPFLIFSHGNSIFRI 

DTEGTNYEQLWDAGVSVIMDFHYNEKRIY 

WVDLERQLLQRVFLNGSRQERVCNIEKNVSG 

MAIN WINEE VI W SNQQEGI1TVTDMKGNNSHI 

LLSALKYPANVAVDPVERFIFWSSEVAGSLY 

RADLDGVGV1CALLETSEKITAV SLD VLDICRL 

FWIQYNREGSNSLICSCDYDGGSVHISKHPTQ 

HNLFAMSLFGDRIFYSTWKMKTIWIANKHTG 

KDMVRINLHSSFVPLGELKVVHPLAQPKAED 

DTWEPEQKLCKLRKGN C S ST VCG QDLQSHL C 

MCAEGYALSRDRKYCEGNDWKYCEDVNEC 

AFWNHGCTLGCKNTPGSYYCTCPVGFVLLPD 

GKRCHQLVSCPRNVSECSHDCVLTSEGPLCF 

CPEGSVLERDGKTCSGCSSPDNGGCSQLCVPL 

SP VS WECDCFPG YDL QLDEK S C AA S GPQPFL 

LFANSQDIRHMHFDGTDYGTLLSQQMGMVY 

ALDHDPX^ENKIYFAHTALKWIERANMDGSQ 

RERLIEEGVDVPEGLAVDWIGRRFYWTDRGK 

SLIGRSDLNGKRSfCIITIENfSQPRGIAVHPMAK 

RLFWTDTGINPRIESSSLQGLGRLV1ASSDLIW 

PSGITE)FLTDKLYWCDAKQSVIEMANLDGSK 

RRRLTQNDVGHPFAVAVFEDYVWFSDWAMP 

SVIRVNKRTGKDRVRLQGSMLKPSSLWVHP 

LAKPGADPCLYQNGGCEHICKKRLGTAWCS 

CREGFMKASDGKTCLALDGHQLLAGGEVDL 

KNQVTPLDILSKTRVSEDNITESQHMLVAEIM 

VSDQDDCAPVGCSMYARC1SEGEDATCQCLK 

GFAGDGKLCSDIDECEMGVPVCPPASSKCINT 

EGGYVCRCSEGYQGDGIHCLDIDECQLGVHS 

CGENASCTNTEGGYTCMCAGRLSEPGLICPD 

o I rrrl lLKL,UUrlH i £> VKNoiyoliCr^LbrlUG YCJL 

HDGVCMYIEALDKYACNCVVGYIGERCQYR 

DLKWWELRHAGHGQQQKVIVVAVCVVVLV 

MLLLLSLWGAHYYRTQKXLSKNPKNPYEESS 

RDVRSRRPADTEDGMSSCPQPWFVVIKEHQD 

LKNGGQPVAGEDGQAADGSMQPTSWRQEPQ 

LCGMGTEQGC WIPV S SDKGS CPQ VMERSFH j 

lvir o J U I y 1 l_rIZ»VJ vJ V GIVi rlOL,LjOr\rirL> VVv^V^rvrvL. 

DPPHQMELTQ 


967 


2317 


A 


8210 


3 


601 


SSAMGSRSSHAAVIPDGDS1RRETGFSQASLL 

RLHHRFRALDRNKKGYLSRMDLQQIGALAV | 

NPLGDRJIESFFPDG SQRVDFPGF VRVL AHFRP 

VEDEDTETQDPKKPEPLNSRRNKLHYAFQLY 

DLDRDGKISRHEMLQVLRLMVGVQVTEEQL 

ENIADRTVQEADEDGDGAVSFVEFTKSLEKM 

DVEHKMSIRILK 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C-Cysteine, 
D=Aspartic Acid, Jb— oiutamic acio, 
F=Phenylalanine, G=GIycine ? H=Histidine, 
I=lsoleucine, K=Lysine, L=Leucine, 
M^Methionine, N^Asparagtne, P=Proline ; 
Q=Glutamine, R=Arginine, S=Serine, 
1 — i nreonine, v — vanne, w — 1 1 Ypiujjucui; 
Y^Tyrosine, X=Unknown. * 5 =Stop codon, 
/^possible nucleotide deletion, \=possiblc 
nucleotide insertion 


968 


2318 


A 


8211 


2 


409 


I S S CPHT A YEG SMSTLSNFTQTLED VFRR1 F IT 

YMDNWRQNTTAEQEALQAKVDAENFYYV1L 

YLMVMIGMFSFIIVAILVSTVKSKRREHSNDP 

YHQYIVEDWQEKYKSQILNLEESKATIHENIG 

AAGFKMSP 


969 


2319 


A 


8215 


1 


1938 


GMPRSRGGRAAPGPPPPPPPPGQAPRWSRWR 

VPGRLLLLLLPALCCLPGAARAAAAAAGAGN 

RAAVAVAVARADEAEAPFAGQNWLKSYGY 

LLPYDSRASALHSAKALQSAVSTMQQFYGIP 

VTGVLDQTT1EWMKKPRCGVPDHPHLSRRRR 

NKRYALTGQKWRQKHITYSIHNYTPKVGELD 

TRKA1RQAFDVWQKVTPLTFEEVPYHEIKSDR 

KEADIMIFFASGFHGDSSPFDGEGGFLAHAYF 

PGPGIGGDTHFDSDEPWTLGNANHDGNDLFL 

VAVHELGHALGLEHSSDPSAIMAPFYQYMET 

HNFKLPQDDLQGIQKIYGPPAEPLEPTRPLPTL 

PVRRIHSPSERKHERQPRPPRPPLGDRPSTPGT 

KPNICDGNFNTVALr KCrbMr V r NUKWr w k.l 

RNNRVQEGYPMQ1EQFWKGLPARIDAAYER 

ADGRFVFFKGDKYWVFKEVTVEPGYPHSLG 

ELGSCLPREGIDTALRWEPVGKTYFFKGERY 

WRYSEERRATDPG YPKPIT V WKGIPQ APQG A 

FISKEGYYTYFYKGRDYWKFDNQKLSVEPGY 

PRNILRDWMGCNQKEVERRKERRLPQDDVD1 

MVTINDVPGSVNAVAWIPCILSLCILVLVYT1 

FQFKNKTGPQPVTYYKRPVQEWV 


970 


2320 


A 


8216 


1235 


2223 


SRLSLQFYVSFRRTGLFTCKLIVEIFFRNYMN 

DSLRTNVFVRFQPETIACACIYLAARALQIPLP 

TRPH WFLLFG 1 1 bfcJbl Qbl^l£, L l,kl t i Kjvjvr in 

YELLEKEVEKRKVALQEAKLKAKGLNPDGTP 

ALSTLGGFSPASKPSSPREVKAEEKSPISINVK 

TVKKEPEDRQQASKSPYNGVRKDSKRSRNSR 

SASRSRSRTRSRSRSHTPRRHYNNRRSRSGTY 

S SRSRSRSRSHSE SPRRHHNHG SPHLKAKHTR 

DDLKSSNRHGHKRKKSRSRSQSKSRDHSDAA 

KKHRHERGHHRDRRERSRSFERSHKSKHHGG 

SRSGHGRHRR 


971 


2321 


A 


8217 


3 


3274 


DCRLQAAMPTNFT V VP VEAHADG GGDETAE 

RTEAPGTPEGPEPERPSPGDGNPRENSPFLNN 

VEVEQESFFEGKNMALFEEEMDSNPMVSSLL 

NKXANYTNLSQGVVEHEEDEESRRREAKAPR 

MGTFIGVYLPCLQNILGVILFLRL.TWIVGVAG 

VLESFLIVAMCCTCTMLTAISMSAIATNGWP 

AGGSYYMISRSLGPEFGGAVGLCFYLGTTFA 

GAMYILGTIEIFLTYISPGAAIFQAEAAGGEAA 

AMLHNMRVYGTCTLVLMALVVFVGVKYVN 

KLALVFLACWLSILATYAGVIKSAFDPPDIPV, 

CLLGNRTLSRRSFDACVKAYGIHNNSATSAL 

WGLFCNGSQPSAACDEYFIQNNVTE1QG1PGA 

ASGVFLENLWSTYAHAGAFVEKKGVPSVPV 

AEESRASTLPYVLTDIAASFTLLVGIYFPSVTG 

IMAGSNRSGDLKDAQKSIPTGTILAIVTTSF1Y 

LSCrVLFGACTEG^.RDKFGEALQGNLVlGM 

LAWPSPWVIV1GSFFSTCGAGLQTLTGAPRLL 

QAIARDGIVPFLQVFGHGKANGEPTWALLLT 

VLICETGILIASLDSVAPILSMFFLMCYLFVNL 

ACAVQTLLRTPNWRPRFKFYHWTLSFLGMSL 

CLALMFICSWYY'ALSAMLIAGCTYKYIEYRG 

AEK :WGDGTRGLSLNAARYALLRVEHGPPHT 

KNWRPQVLVMLNLDAEQAMKHPliLLSFT'SQ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beciimin fr 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A— Alanine C=Cysteine, 

D=A«;narlic AciH P=f~Tliif?imir; Arid 

F=Phenylalanine, G=Glycinc, H=Histidine, 
I=lso leucine, K=Lysine. L=Leucine, 
M=Melhionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threon ine, V= Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon, 
/^possible nucleotide deletion, \=possiblc 
nucleotide insertion 














LKAGKGLTIVGSVLEGTYLDKHMEAQRAEE 
N1RSLMSTEKTKGFCQLVVSSSLRDGMSHLIQ 
SAGLGGLKHNTVLMAWPASWKQEDNPFSW 
KNFVDTVRDTTAAHOAI .1 VAKNVD^FPOTsin 
ERFGGGHIDVWWINOiDGGMLMLLPFLLRQH 
KVWRKCRMRim^AQVT>DNSIQMKKDLQNlF 
L YH1 ,RIS AEVE VVEMVENDISAFTYERTLMM 
EQRSQMLKQMQLSKNEQEREAQLIHDRNTAS 
HTAAAARTOAPPTPDKVOMTWTRFKT TAFK" 

YRSRDTSLSGFKDLFSMKPDQSNVRRMI-ITAV 
KLNG WLNK SQDA QL VLLNMPGPPKNRQGD 
ENYMEFLEVLTEGLNRVLLVRGGGREVITIYS 


972 


2322 


A 


8224 


701 


246 


TSRRVTMKFNPFVTSDRSKNRKRHFNAPSHV 
RRKI M SSPL SKELRQKYN VRSMPIRKDDEVQ 
VVRGHYKGQQIGKWQVYRKKYVIYIERVQ 
REKANGTTVHVGIHPSKVVITRLKLDKDRKKI 
LERKAKSRQVGKEKGKYKEELIEKMQE 


973 


2323 


A 


8237 


873 


4610 


GCPHAGGKGRVPTGGLTGGRTWSPSAAPRSC 

PRPGPTPAPGAMDKLPPSMRKRLYSLPQQVG 

AKAWIMDEEEDAEEEGAGGRQDPSRRSIRLR 

PLPSPSPSAAAGGTESRSSALGAADSEGPARG 

AGKSSTNGDCRRFRGSLASLGSRGGOSGGTG 

SGSSHGHLHDSAEERRL1AEGDASPGEDRTPP 

GLAAEPERPGASAQPAASPPPPQQPPQPASAS 

CEQP S VDTAIKVEGGAAAGDQ1LPE AEVRLG 

QAGFMQRQFGAMLQPGVNKFSLRMFGSQKA 

VEREQERVKSAGFWIIHPYSDFRFYWDLTML 

LLMVGNLIIIPVGITFFKDENTTPWIVFNVVSD 

TFFLIDLVLNFRTGIVVEDNTEIILDPQRIKMK 

YLKS WFM VDFISS1P VD YIFLI VETR JDSEV YK 

TARALRIVRFTKILSLLRLLRLSRLIRYIHQWE 

EIFHMTYDLASAVVRIVNLIGMMLLLCHWDG 

CLQFLVPMLQDFPDDCWVSINNMVNNSWGK 

QYSYALFKAMSHMLCIGYGRQAPVGMSDV 

WLTMLSMIVGATCYAMFIGHATALIQSLDSS 

RRQYQEKYKQVEQYMSFHKLPPDTRQRIHD 

YYEHRYQGKMFDEESILGELSEPLREEITNFNC 

RKLVASMPLFANADPNFVTSMLTKLRFEVFQ 

PGDYIIREGTIGKKMYFIQHGVVSVLTKGNKE 

TKLADG S YFGEICLLTRGRRTAS VRADT YCR 

LYSLSVDNFNEVLEEYPMMRRAFETVALDRL 

DRIGKKNSILLHKVQHDLNSGVFNYQENEIIQ 

QIVQHDREMAHCAHRVQAAASATPTPTPVIW 

TPLIQAPLQAAAATTSVAIALTHHPRLPAAIFR 

PPPGSGLGNLGAGQTPRHLKRLQSLIPSALGS 

ASPASSPSQVDTPSSSSFHIQQLAGFSAPAGLS 

PLLPSSSSSPPPGACGSPSAPTPSAGVAATTIA 

GFGHFHKALGGSLSSSDSPLLTPLQPGARSPQ 

A AOP'sPAPPfJ ARfifil Hi PPM^l PPPPQVIPCPQC 

SPGQLGQPPGELSLGLATGPLSTPETPPRQPEP 

PSLVAGASGGASPVGFTPRGGLSPPGHSPGPP 

RTFPSAPPRASGSHGSLLLPPASSPPPPQVPQR 

RGTPPLTPGRLTQDLKLISASQPALPQDGAQT 

LRRASPHSSGESMAAFPLFPRAGGGSGGSGSS 

GGLGPPGRPYGA1PGQHVTLPRKTSSGSLPPP 

LSLFGARATSSGGPPLTAGPQREPGARPEPVR 

SKLPSNL 


974 


2324 


A 


8247 


279 


468 


EYKQWERRFLSCQNRNDLGYGKPRKGGGLL 
LVPVKDASRICSLTYLLGSHWNNLWRSPVL 

G ! 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid. E~Glutamic Acid, 
F=Phenylalanine, G=Glycinc, H=Histidine, 
I=IsoIeucine, K=Lysine. LHLeucine, 
M=Methionine, N=Asparagine, P=Protine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine : V=V aline, W= I ryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 


975 


2325 


A 


8249 


62 


1571 


LVALKNWKPKGTNIPAPQSPVFGEAVSGVYM 

MTKVLGMAPVLGPRPPQEQVGPLMVKVEEK 

EEKGKYLPSLEMFRQRFRQFGYHDTPGPREA 

LSQLRVLCCEWLRPEIHTKEQILELLVLEQFLT 

ILPQELQAWVQEHCPESAEEAVTLLEDLEREL 

DEPGHQVSTPPNEQKPVWEKISSSGTAKESPS 

SMQPQPLETSHKYESWGPLYIQESGEEQEFAQ 

DPRKVRDCRLSTQHEESADEQKGSEAEGLKG 

DIISVIIANKPEASLbKQUViNLbNbK-vj 1 JSJ J rLrV 

EAGSKKGRESVPTKPTPGERRYICAECGKAFS 

NSSNLTKHRRTHTGEKPYVCTKCGKAFSHSS 

NLTLHYRTHLVDRPYDCKCGKAFGQSSDLLK 

HQRMHTEEAPYQCKDCGKAFSGKGSLIRHYR 

IHTGEKPYQCNECGKSFSQHAGLSSHQRLHT 

GEKPYKCKECGKAFNHSSNFNKHHRIHTGEK 

PYWCI IHCGKTFCSKSNLSKFIQRVHTGEGEA 

P 


976 


2326 


A 


8257 


298 


7086 


GNMACWPQLRLLLWKNLTFRRRQTCQLLLE 

V A WPL FIPL ILI S VRLS YPP YEQHECHFPNKAM 

PSAGTLPWVQGIICNANNPCFRYPTPGEAPGV 

VGNFNKSIVARLFSDARRLLLYSQKDTSMKD 

MRKVLRTLQQIKKSSSNLKLQDFLVDNETFS 

GFLYHNLSLPKSTVDKMLRADVILHKVFLQG 

YQLHLTSLCNGSKSEEMIQLGDQEVSELCGLP 

REKL AAAERVLRSNMD1LKPILRTLN STSPFPS 

KELAEATKTLLHSLGTLAQELFSMRSWSDMR 

QEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEG 

GGLICIKSLNWYEDNNYKALFGGNGTEEDAE 

TFYDNSTTPYCNDLMKNLESSPLSRIIWKALK 

PLLVGKILYTPDTPATRQVMAEVNKTFQELA 

VFHDLEGMWEELSPKIWTFMENSQEMDLVR 

MLLDSRDNDHFWEQQLDGLDWTAQDIVAFL 

AKHPEDVQSSNGSVYTWREAFNETNQAIRTIS 

RFMECVNLNKLEPIATEVWLINKSMELLDER 

KFW AG I VFTGITPG SIELPHHVKYKIRMGIDN 

VERTNKIKDGYWDPGPRADPFEDMRYVWGG 

FAYLQDVVEQAIIRVLTGTEKKTGVYMQQMP 

YPCYVDDIFLRVMSRSMPLFMTLAWIYSVAV 

1IKG1VYEKEARLKETMRIMGLDNSILWFSWFI 

SSLIPLLVSAGLLVVILICLGNLLPYSDPSVVFV 

FLSVFAVVTILQCFLISTLFSRANLAAACGGn 

YFTLYLPYVLCVAWQDYVGFTLKIFASLLSP 

VAFGFGCEYFALFEEQG1GVQWDNLFESPVE 

EDGFNLTTSVSMMLFDTFLYGVMTWYIEAVF 

PGQYGIPRPWYFPCTKSYWFGEESDEKSHPGS 

NQKRISEICMEEEPTHLKLGVSIQNL\OCVYRD 

GMKVAVDGLALNFYEGQITSFLGHNGAGKT 

TTMSILTGLFPPTSGTAYII.GKDIRSEMSTIRQ 

NLG VCPQHNVLFDMLTVEEHI WFYARLKGLS 

EKHVKAEMEQMAIZ)VGLPSSKLKSKTSQLS 

GGMQR1CLSVALAFVGGSKVVILDEPTAGVDP 

Y^RRGIWEI I LKYROGRTIILSTHHMDEADVL 

GDRIAIISHGKLCCVGSSLFLKNQLGTGYYLT 

I.VKKDVESSLSSCRNSSSTVSYLKKEDSVSQS 

SSDAGLGSDHESDTLT1DVSAISNLIRKHVSEA 

RLVEDIGHELTYVLPYEAAKEGAFVELFHEID 

DRLSDLGISSYGISETTLEEIFLKVAEESGVDA 

ETSDGTLPARRNRRAF GDKQSCLRPFTEDDA 

ADPNDSC T)PESRETDLLSGMDGKGSYQVKG 

WKLTQQQFVALLWKRLLIARRSRKGFFAQIV 
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SEQ ID 
MO- of 
nucl- 
eotide 
seq- 
uence 


SHQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
hepinninp 

U/w£^J 1111 AUK, 

nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 

HUVIWllUv 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine OCysteine, 

F*=Phenylalanine, G=Glycine, H=Histidine, 
Msoleucine. K-Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=ProIine, 
Q=Glutamine, R=Argintne, S=Serine, 
T=Threonine, V=VaIine, W-Tryptophan, 
Y=Tyrosine, X=Unkno\vn s *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














LPAVFVCIALVFSLIVPPFGKYPSLELQPWMY 

NEQ YTFVSNDAPEDTGTLELLN ALTKJDPGFG 

TRCMEGNPIPDTPCQAGEEEWTTAPVPQTIM 

DLFQNGNWTMQNPSPACQCSSDKIKKMLPV 

CPPGAGGLPPPQRKQNTADILQDLTGRNISDY 

LVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVS 

NTQALPPSQEVTvJDATKQMKKHLKLAKDSSA 

DRFLNSLGRFMTGLDTRN^NVKVWFNNKGW 

HAISSFLNVINNA1LRANLQKGENPSHYGITAF 

NHPLNLTKQQLSEVAPMTTSVDVLVSICVIFA 

MSF VPASF VVFLIQERVSKAKHLQFI SG VKP VI 

YWLSNFVWDMCNYWPATLVIIIFICFQQKSY 

VSSTNLPVLALLLLLYGWSITPLMYPASFVFK 

IPSTAYVVLTSVNLFIG1NGSVATFVLELFTDN 

KLNNrNDILKSVFLIFPHFCLGRGLIDMVKNQ 

AMADALERFGENRFVSPLSWDLVGRNLFAM 

AVEGWFFLITVLIQYRFFIRPRPVNAKLSPLN 

DEDEDVRRERQRILDGGGQNDILEIKELTKIY 

RRKRKPAVDRJCVGIPPGECFGIJ..GVNGAGK 

SS'lTKMLTGDnvrRGDAFLNRNSILSNlHEV 

VPEKEVGKVGEWAIRKLGLVKYGEKYAGNY 
SGGNKRICLSTAMALIG GPP VVFLDEPTTGMD 
PKARRFLWNCALSVVKEGRSVVLTSHSMEEC 
EALCTRIMAIMVNGRFRCLGSVQHLKNRFGD 
GYTIVVRIAGSNPDLKPVQDFFGLAFPGSVPK 
EKHR^ILQYQLPSSLSSLAJRIFSILSQSKKRLH 
IEDYSVSQTTLDQVFVNFAKDQSDDDHLKDL 
SLIIKNQTWDVAVLTSFLQDEKVKESYV j 


977 


2327 


A 


8260 


3 


1567 


1PGSTISFSLCFIFPPCVPTMVRKPVVSTISKGG 

YLQGNVNGRJLPSLGNKEPPGQEKVQLKRKV 

TLLRGVSIIIGTJJGAGIFISPKGVLQNTGSVGM 

SLTIWTVCGVLSLFGALSYAELGTTIKKSGGH 

YTYILEVFGPLPAFVRVWVELLIIRPAATAVIS 

LAFGRYILEPFFIQCEIPELAIKLITAVGITWM 

VLNSMSVSWSARIQIFLTFCKLTAILIIIVPGV 

1V401 iK'nnTnMFR r nAP<snpn«;<:TTRi pi apvvt; 

lvlv^J_,irv\J^ 1 yiNrlVUArijUivlvool I lvl_,_r JUrvr I T \J 

MYAYAGWFYLNFVTEEVENPEKTIPLAIC1SM 

AIVTIGYVLTNVAYFTTINAEELLLSNAVAVT 

FSERLLGNFSLAVPIFVALSCFGSMNGGVFAV 

SRLFV'VASREGHLPEILSMIHVRKHTPLPAVIV 

LHPLTMIMLFSGDLDSLLNFLSFARWLFIGLA 

VAGLIYLRYKCPDMHRPFKVPLFIPALFSFTC 

LFMVALSLYSDPFSTG1GFVITLTGVPAYYLFII 

WDKKPRWFRIMSEKITRTLQIILEWPEEDKL 


978 


232S 


A 


8261 


2 


2165 


RGGSLRCVLGKLLGQLLCFQSERCVRFPEGLL 

RHRGCGLLSSRLSAGKPPLRTSFFGSWGVLPP 

LADAASMSGVRAVRIS1ESACEKQVHEVGLD 

GTETYLPPLSMSQNLARLAQRIDFSQGSGSEE 

EEAAGTEGDAQEWPGAGSSADQDDEEGWK 

FQPSLWPWDSVRNNLRSALTEMCVLYDVLSI 

VRDKKFMTLDPVSQDALPPKQNPQTLQLISK 

KKSLAGAAQILLKGAERLTKSVTENQENKLQ 

RDFNSELLRLRQHWKLRKVGDKILGDLSYRS 

AGSLFPHHGTFEV1KNTDLDLDKKIPEDYCPL 

DVQ1PSDLEGSAYIKVSIQKQAPDIGDLGTVN 

JLFKRPLPKSKPG SPH WQTKLEAAQNVLLCKEI 

FAQLSREAVQIKSQVPHIVVKNQIISQPFPSLQ 

LS1SLCHSSNDKKSQKFATEKQCPEDHLYVLE 

HNLHLUREFHKQTLSSIMMPHPASAPFGHKR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
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Predicted end 
nucleotide 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

H— A cr\rartif AriH P=OllltfltYlif Acid 

F=Phenylalanine, G=Glycine, H=Histidine 5 
I=lsoleucine, K^Lysine, L=-Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion. V=possible 

nnflentide insertion 














MRLSGPQAFDKNEINSLQSSEGLLEKIIKQAK 

HIFLRSRAAATIDSLASRIEDPQIQAHWSNIND 

VYESSVKVLITSQGYEQICKSIQLQLNIGVEQI 

RWHRDGRVITLSYQEQELQDFLLSQMSQHQ 

VHAVQQLAKVMGWQVLSFSNHVGLGPIESIG 

NASAITVASPSGDYAISVRNGPESGSKIMVQF 

PRNQCKDLPKSDVLQDNKWSHLRGPFKEVQ 

WNKJVTEGRNF V YK MELLM S ALSPCLL 


979 


2329 


A 


8289 


2 


1053 


FVWNPRGGRKRRRQAAVTQAATRASGTPSP 

RJDGTMTQGKLSVANKAPGTEGQQQVHGEKK 

EAPAVPSAPPSYEEATSGEGMKAGAFPPAPTA 

\/pr upqwa VVnP^^*? YnNfiFPTGDHFLFTT 

FSWDDQKVRRVFVRKVYTILLIQLLVTLAVV 

ALFTFCDPVKDYVQANPGWYWASYAVFFAT 

YLTLACCSGPRRHFPWNLILLTVFTLSMAYLT 

GMLSSYYNTTSVLLCLGITALVCLSVTVFSFQ 

TKFDFTSCQGVLFVLLMTLFFSGL1LAILLPFQ 

YVPWLHAVYAALGAGVFTLFLALDTQLLMG 

NRRHSLSPEE YIFG ALNIYLDIIYIFTFFI ,QLFG 

TNRE 


980 


2330 


A 


8305 


59 


857 


A cm pnVQTQPPQl PPRTCJFWP^PTT A RV AM AFP 

SEATQSHSISSSSFGAEPSAPGGGGSPGACPAL 

GTKSCSSSCAVHDLIFWRDVICKTGFVFGTTLI 

MLLSLAAFS VIS VVSYLIL ALLS VTISFRIYKSV 

TQAVQKSEEGHPFICAYLDVD1TLSSEAFHNY 

MNAAMYHINRALKLIIRLFLVEDLVDSLiO.A 

VFMWLMTYVGAVFNGITLLILAELLIFSVPIV 

YEKYKTQIDHYVGIARDQTKSIVEKIQAKLPG 

IAKKKAE 


981 


2331 


A 


8308 


186 


1337 


TRMSRHEGVSCDACLKGNFRGRRYKCLICYD 
YDLCASCYESGATTTRHTTDHPMQCILTRVD 
FDLYYGGEAFSVEQPQSFTCPYCGKMGYTET 
SLQEHVTSEHAETSTEVICPICAALPGGDPNH 

VTnnPAAHT TT FWR APRHT DFSSG VRTTVRR 

MFHPGRGLGGPRARRSNMHFTSSSTGGLSSS 
QSSYSPSNREAMDPIAELLSQLSGVRRSAGGQ 
LNSSGPSASQLQQLQMQLQLERQHAQAARQ 1 
QLETARNATRRTNTS S VTTTITQSTATTN1 AN 
TESSQQTLQNSQFLLTRLNDPKMSETERQSM 
ESERADRSLFVQELLLSTLVREESSSSDEDDR 
GEMADFGAMGCVDIMPLDVALENLNLKESN 
KGNEPPPPPL 


982 


2332 


A 


8315 


1 


1004 


GSTHASADAWAQWFCTEALVMGAPVWYLV 

AAALLVGFILFLTRSRGRAASAGQEPLHNEEL 

AHAGRVAOPGPI FPFEPRAGGRPRRRRDLGS 

RLQAQRRAQRVAWAEADENEEEAVILAQEE 

EGN^EKPAETHLSGKIGAKKLRKLEEKQARKA 

QREAEEAEREERKRLESQREAEWKKEEERLR 

LEEEQKJEEEERKAREEQAQREHEEYLKLKEA 

FWEEEGVGETMTEEQSQSFLTEFINYIKQSK 

V VLLEDLASQ VGLRTQDTINR1 QDLL AEGTIT 

GVIDDRGKFIYITPEELAAVANFIRQRGRVSIA 

ELAQASNSLIAWGRESPAQAPA 


983 


2333 


A 


8320 


244 


1420 


RRR WRARGGL VPTL A W AE ATG A YVPGRDKP 
DLPTWKRNFRSALNRKEGLRLAEDRSKDPHD 
PHKIYEFVNSGVGDFSQPDTSPDTNGGGSTSD 
TQEDILDELLGNMVLAPLPDPGPPSLAVAPEP 
CPQPLRSPSLDNPTPFPNLGPSENPLKRLLVPG 
EEWEFEVTAFY" 1GRQVFQQTISCPEGLRLVGS 
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INVj. 01 

nucl- 
eotide 
seq- 
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SEQ ID 
or 

peptide 
seq- 
uence 


Met 

nOu 


SEQ 
in 

XJSSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
pepuue 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A*= Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenyl alanine, G=Glycine, FNHistidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unlcnown, *=Stop codon, 
/^possible nucleotide deletion, \ = =possible 
nucleotide insertion 






1 








EVGDRTLPGWPVTLPDPGMSLTDRGVMSYV 

RHVLSCLGGGLALWRAGQWLWAQRLGHCH 

TYWAVSEELLPNSGHGPDGEVPKDKEGGVF 

DLGPFIVGSLGPPDLITFTEGSGRSPRYALWFC 

VGESWPQDQPWTFCRLVMVK\^VPTCLRALVE 

MARVG GAS SLENT VDLHISN SHPL SLTSDQ Y 

KAYLQDLVEGMDFQGPGES 


984 


2334 


A 


8321 


1 


1243 


ANMAPVEHVVADAGAFLRHAALQDIGKNIY 

TIREVVTEIRDKATRRRLAVLPYELRFKEPLPE 

YVRLVTEFSKKTGDYPSI.SATDIQVLALTYQL 

EAEFVGVSHLKQEPQKVKVSSSIQHPETPLHIS 

GFHLP YKJPKPPQETEKGHSACEPENLEFS SFM 

FWRNPLPNIDHELQELLIDRGEDVPSEEEEEEE 

NG FEDRKDDSDDDGGG WITPSNIKQIQQEL E 

QCDVPEDWVGCLTTDFAMQNVLLQMGLHV 

LAVNGMLIREARSYILRCHGCFKTTSDMSRV 

FCSHCGNKTLKKVSV'IVSDDGTLHMHFSRNP 

KVLNPRGLRYSLPTPKGGKYAINPHLTEDQRF 

PQLRLSQKARQKTNVFAPDYIAGVSPFVENDI 

SSRSATLQVRDSTLGAGRRRLNPNASRKKFV 

KKR 


9S5 


2335 


A 


S322 


352 


529 


RRNNIRQFIMKVCISGQARWLTPVVPVLWET 
EAGRSLELKSLRPAWATWGNPISTKINK 


986 


2336 


A 


8325 


89 


1172 


1CMNPTDIAJDTTLDESIYSNYYLYESIPKPCTKE 

GIKAFGELFLPPLYSLVFVFGLLGNSVVVLVL 

FKYKRLRSMTDVYLLNLAISDLLFVFSLPFWG 

Yy AADQWVFGLGLCICIvnSWMYLVGFYSGIF 

FVMLMSIDRYLA1VHAVFSLRARTL1YGVITS 

LATWSVAVFASLPGFLFSTCYTERNHTYCKT 

K Y SLN STT WK VLS SLEINILG L VIPLG IMLFC Y 

SM1IRTLQHCK2n[EKKNKAVKMIFAVVVLFLG 

FWTPYNiyLFLETLVELE\ r LQDCTFERYLDYA 

IQATETLAFVHCCLNPIIYFFLGEKFRKYILQL 

FKTCRGLF VLCQ YCGLLQI Y S AJDTPS SS YTQ S 

TMDHDLHDAL 


987 


2337 


A 


8326 


3 


470 


SLSAMRFLAATFLLLALSTAAQAEPVQFKDC 

GSVDGVIKEVNVSPCPTQPCQLSKGQSYSVN 

VTFTSNIQSKSSKAVVHG1LMGVPVPFPIPEPD 

GCKSGINCPIQKDKTYSYLNKXPVKSEYPSIK 

LWEWQLQDDKNQSLFCWEIPVQIVSHL 


988 


2338 


A 


8335 


1205 


323 


VIKMALAARLLPQFLHSRSLPCGAVRLRTPA 

VA£VRI^PSATLCYFCRCRLGLG.AALFPRSAR 

ALAASALPAQGSRWPVLSSPGLPAAFASFPAC 

r vK5> Y z> I fc.bK rVjQHQK. I KJvll VLCjFbNPIN WV 

RTRIKAFL1 W AYFDKEFSITEFSEG AKQ AFAH 

VSKLLSQCKFDLLEELVAKEVLHALKEKVTS 

LPDNHKNALAANIDEIVFTSTGDISIYYDEKG 

RKFVNILMCFWYLTSANIPSETLRGASVFQVK 

LGNQNVETKQLLSASYEFQREFTQGVKPDWT 

IARIEHSKLLE 


989 


2339 


A 


8349 


67 


185 


KSLHPMS 


990 


2340 


A 


8361 


210 


1115 


ASPFLRPQGHDSGEREPFSQTPGLMQPFSIPVQ 

ITLQGSRRRQGRTAFPASGKKRETDYSDGDPL 

DVHKIU-PSSTGEDRAVMLGFAMMGFSVLMF 

FLLGTTILKPFMLSIQREESTCTA1HTDIMDDW 

LDCAFTCGVHCHGQGKYPCLQVFVNLSHPG 

QKALLHYNEEAVQINPKCFYTPKCHQDRNDL 

LNSALDIKEFFDHKNGTPFSCFYSPASQSEDVI 
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ng to first 
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nucleotide 
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Lo last amino 
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of peptide 
sequence 


Amino acid sequence (A-Alanme C=Cysteine 5 
]j=A.spartic aciq, o— oiuLainiv/ /\^iu, 
F=Phenylalanine 5 G=Glycinc, H^Histidinc, 
l=Isoleucine, K=Lysine 3 L=Leucine, 

\A — K/ff»*li inning Kf= A vnjirHiJine P = Proline. 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan 5 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














LIKKYDQMAIFHCLFWPSLTLLGGALIVGMV 
RLTQHLSLLCEKYSTVVRDEVGGKVPYIEQH 
OFKLCIMRRSKGRAEKS 


991 


2341 


A 


8369 


9 


921 


"SSVVEFSALSVSMACLSPSQLQKFQQUUi'LVL 

burLoAttL V Alvlk^ vlvl vJOl v ayIj1v.il/ v r an. i 

EFSTQEEEQLRAQGSTDYFLSSGDKIRFFFEK 

GVFDEKGNFLVPPEKSINKIGHALHAHDPVFK 

S1THSFKVQTLARSLGLQMPVVVQSMYTFKQP 

HFGGEVSPHQDASFLYTEPLGRVLGVWIAVE 

DATLENGCLWFIPGSHTSGVSRRMVRAPVGS 

APGTSFLGSEPARDNSLFVPTPVQRGALVLIH 

GEVVHKSKQNLSDRSRQAYTFHLMEASGTT 

W SPEN WLQPT AELPFPQL YT 


992 


2342 


A 


8370 


906 


4 


MALSGNCSRYYPREQGSAVPNSFPEWELNV 

/^r>A\7vrTDHOTl TQTPT4QT T WTC\zTP^PKT?r>TAN 

GGQVYr I Krio 1 .Liolr rlol^JLWivLvir orrs^ro-/ x /ajn 

DL AKD SKGRFFIDRDGFLFRYILD YLRDRQ W 

LPDHFPEKGRLKREAEYFQLPDLVKLLTPDE1 

KQSPDEFCHSDFEDASQGSDTR1CPPSSLLPAD 

RKWGFITVGYRGSCTLGREGQADAKFRRVPR 

ILVCGRISLAKEVFGETLNESRDPDRAPERYTS 

RFYLKFKHLMGAPASNFILGFWGLGQNQDK 

HPVN1YLQQRSVIRPDLTSKKAGDLKGKGDA 

QEV SRRRRWLGDPEHL 


993 


2343 


A 


S379 


1 


2794 


MRMQRHKNDTMDFGDSGKRIGGGVLCLLHQ 

SNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIE 

DMVTTASTYLFEATEKRFFFKNVSILIPENWK 

ENPQYKRPKHENHKHADVIVAPPTLPGRDEP 

YTKQFTECGEKGEYIHFTPDLLLGKKQNEYG 

PPGKLFVHEWAHLRWGVFDEYNEDQPFYRA 

KSKKIEATRCSAGISGRNRVYKCQGGSCLSRA 

CRIDSTTKLYGKDCQFFPDKVQTEKASIMFM 

QSIDSVVEFCNEKTHNQEAPSLQNIKCNFRST 

WEVISNSEDFKNTIPMVTPPPPPVFSLLKIRQRI 

VCLVLDKSGSMGGKDl^NRMNQAAKHFLLQ 

TVENGSWVGMVHFDSTATIVNICLIQIKSSDER 

NTLMAGLPTYPLGGTSICSGIKYAFQVIGELH 

SQLDGSEVLLLTDGEDNTASSC1DEVKQSGAI 

VHFIALGRAADEAVIEMSKITGGSHFYVSDEA 

QNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 

LNSNAWMNDTVIIDSTVGKDTFFLIT\¥NSLPP 

S1SLWDPSGTIMENFTVDATSKMAYLSIPGTA 

KVGTWAYNLQAICANPETLTITVTSRAANSSV 

PPITVNAKMNKDVNSFPSPMIVYAEILQGYVP 

VLG AN VTAFIES QNGHTE VLELLDNG AGADS 

i-i^KTr»r i \rvQT?vrTAVTP"KrrTRY c; iT KVRAHGGA 

NTARLKLRPPLNRAAY1PGWVVNGEIEANPP 

RPEIDEDTQTTLEDFSRTASGGAFVVSQVPSL 

PLPDQYPPSQITDLDATVHEDKIILTWTAPGD 

NFDVGKVQRYIIRISASILDLRDSFDDALQVN 

TTDLSPKEANSKESFAFKPENISEENATHIFIAI 

KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPT 

PTPTPTPTPDKSHNSGVNISTLVLSVIGSWIY 

NFILSTTI 


994 
995 


2344 
2345 


A 
A 


8385 
8390 


231 
194 


644 
3421 


" INSSPRTGRDHQELNLHTERDSRSQRAVLKIP 
RQNPGIFYWIFLPSRSHSASHGSRQRQVSCQG 
TQDE1LK^4RNTFAELKNSLEALSSRMDQAEE 
RIGTQAGVQWRDHGSLQPQPPEFKQCFHLSL 
PSSWDYRACLS 

" " AWRKSSVVPPRGTRI GEKSDQDKSGQKNKR 



289 



MSDOCID <WO 0157188A2_I_> 



WO 01/57188 




T/USO 1/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 
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Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid. 
F=Phenylalanine, G=Glycine, H=Hislidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine } W=Tryptophan, 
Y=Tyrosine. X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














DFLSMKQSPALAPEERCRRAGSPKPVLRAJDD 

NNMGNGCSQKLATANLLRFLLLVLIPCICALV 

LLLEILLSYVGTLQKVYFKSNGSEPLVTDGEI 

QGSDVILTNTIYNQSTVVSTAPIPDQHVPAWT 

TDASLPGDQSHRNTSACMN1THSQCQMLPYH 

ATLTPLLSVVRNMEMEKFLKFFTYLHRLSCY 

QHIMLFGCTLAFPECIIDGDDSHGLLPCRSFCE 

AAKEGCESVLGMVNYSWPDFLRCSQFRNQT 

ESSNVSRICFSPQQENGKQLLCGRGENFLCAS 

GICIPGKJLQCNGYNDCDDWSDEAHCNCSENL 

FHCHTGKCLNYSLVCDGYDDCGDLSDEQNC 

DCNPTTEHRCGDGRC1AMEWVCDGDHDCVD 

KSDEVNCSCHSQGLVECRNGQCIPSTFQCDG 

DEDCKDGSDEENCSVIQTSCQEGDQRCLYNP 

CLDSCGGSSLCDPNNSLNNCSQCEPITLELCM 

NLPYNSTSYPNYFGHRTQKEASISWESSLFPA 

LVQTNCYKYLMFFSCTII.VPKCDVNTGEHIPP 

CRALCEH SKERCES VLG 1 VG LQ WPEDTDCSQ 

FPEENSDNQTCLMPDE Y VEECSPSHFKCRSG Q 

CVLASRRCDGQADCDDDSDEENCGCKERDL 

WECPSNKQCLKHTVICDGFPDCPDYMDEKN 

CSFCQDDELECANHACVSRDLWCDGEADCS 

DSSDEWDCVTLSFNVNSSSFLMVHRAATEHH 

VCADGWQE1LSQLACKQMGLGEPSVTKL1QE 

QEKEPRWLTLHSNWESLNGTTLHELLVNGQS 

CESRSKISLLCTKQDCGRRPAARMNKRILGGR 

TSRPGRWPWQCSLQSEPSGH1COCVLIAKKW 

VLT V AHCFEGREN AA V WK V VLG INNLDHPS 

VFMQTRFVKTIILHPRYSRAVVDYDISIVELSE 

DISETGYVRPVCLPNPEQWLEPDTYCYITGW 

GHMGNKMPFKLQEGEVRHSLEHCQSYFDMK 

TITTRMICAGYESGTVDSCMGDSGGPLVCEK 

PGGRWTLFGLTSWGSVCFSKVLGPGVYSNVS 

YFVEWIKRQIYIQTFLLN 


996 


2346 


A 


S3 92 


199 


3085 


KV1LSSEMSKTNKSKSGSRSSRSRSASRSRSRS 

FSKSRSRSRSLSRSRKRRLSSRSRSRSYSPAHN 

RERNHPRVYQNRDFRGHN RG YRRP Y YFRGR 

NRGFYPWGQYNRGGYGNYRSNWQNYRQAY 

SPRRGRSRSRSPKRRSPSPRSRSHSRNSDKSSS 

DRSRRSSSSRSSSNHSRVESSKRKSAKEKKSSS 

KDSRPSQAAGDNQGDEVKEQTFSGGTSQDTK 

ASESSKPWPDATYGTGSASRASAVSELSPRER 

SPALKSPLQSVVVRRRSPRPSPVPKPSPPLSST 

SQMGSTLPSGAGYQSGTHQGQFDHGSGSLSP 

SKKSPVGKSPPSTGSTYGSSQKEESAASGGAA 

YTKRYLEEQKTENGKDKEQKQTNTDK£KIKE 

KGSFSDTGLGDGKMKSDSFAPKTDSEKPFRG 

SQSPKJ^YKl^RDDFEKKMADFHKEEMDDQDK 

DKAKGRKESEFDDEPKFMSKVIGANKNQEEE 

KSGKWEGLVYAPPGKEKQRKTEELEEESFPE 

RSKKEDRGKRSEGGHRGFVPEKNFRVTAYK 

AVQEKSSSPPPRKTSESRDKLGAKGDFPTGKS 

SFS1TREAQVNVRMDSFDEDLARPSGLLAQER 

KLCRDLVHSNKKEQEFRSIFQHIQSAQSQRSP 

SELFAQHIVTIVHHVKEHHFGSSGMTLHERFT 

KYLKRGTEQEAAKNKKSPEIHRRIDISPSTFRK 

HGLAHDEMKSPREPGYKAEGKYKDDPVDLR 

LDIERRKKHKERDLKRGKSRESVDSRDSSHSR 

ERS AEK.TEKTHKG SKKQKJCHRRARDRSRSS S 

SSSQSSHSYKAEEYTEETEEREESTTGFDKSRL 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A= Alanine C=Cysteine, 

L)==Aspartic aciu, oiuianii^ ^l-iu, 

F=Phenylalanine, G=Glycine, H=Histidine, 

I=Isoleucine 5 K=Lysine, L=Leucine, 

M^Methionine, N=Asparagine, P=Proline, 

Q=GIutamine, R=Arginine, S=Serine, 

tt— oAnino \/=sVnlinp W=Trvr>tot)han. 
I — i nreonine, v — > <iimc, w i ^ivpii«**»j 

Y=Tyrosine, X=Unknown, *=Stop codon, 

/=possible nucleotide deletion, V=possible 

nucleotide insertion 














r^TT^rkPvr^p^FR nfrHR 4.RGTFOFRARGRGWG 
RGNYSGNNNNNSNNDFQKRNREEEWDPEYT 
PKSKK Y YLHDDRE GEG SDICWV S RGRGRG AF 
PRGRGRFMFRKSSTSPKWAHDKFSGEEGEIE 
DDESGTENREEKDNIQPTTE 


997 


2347 


A 


839S 


202 


552 


r pAi r,r,pom onTRT T W 4FIDSGVGGOKAKS 
KQENLESLEATGREEEGGQGPPVTTKGVLLA 
LLMAGL ALQPGTALLC YSCKAQ V SNEDCLQ 
VENCTQLGEQCWTARIREWGDDSRQA 


998 


2348 | 


A 


8400 


697 


301 


NPPSACTPGSCDSCSGRGRDLAFDSYWSTNN 

MSDPRRPNKVLRYKPPPSECNPALDDPTPDY 

MNLLGMIFSMCGLMLICLKWCAWVAVYCSFI 

SFANSRSSEDTKQMMSSFMLSISAWMSYLQ 

NPQPMTPPW 


999 


2349 


A 


8401 


93 


1126 


ASASHITSGHLRCFPGSEGVGTMARCFSLVLL 
LTSIWTTRLLVQGSLRAEELSIQVSCRIMG1TL 

VETALKASFETCSYG W VGDGFVV1 SRI SPNPK 
CGKNGVGVLIWKVPVSRQFAAYCYNSSDTW 
TNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYS 
VASPYSTIPAPTTTPPAPASTSIPRRKKLICVTE 
VFMETSTMSTETEPFVENKAAFKNEAAGFGG 
VPTALLVLALLFFGAAAGLGFCYVKRYVKAF 
PFTNKNQQKEMIETKVVKEEKANDSNPNEES 
KKTDKNPEESKSPSKTTMRCLEAEV 


1000 


2350 


A 


8406 


2 


777 


ICE R C Q r V V rvl JVLL o 1 VUorLvlJLVJNiiLAiSAJirv i 
AAIFTADGNMISASTLMDILLMNDFKLVINK1 
AYDVQCPKREKPSNEHTAEMEHMKSLVHRL 
FTILKLEESQKKREHHLLEKIDHLKEQLQPLE 
QVKAG1EAHSEAKTSGLLWAGLALLSIQGGA 
LAWLTWWVYSWDIMEPVTYFITFANSMVFF 
AYFIVTRQDYTYSAVKSRQFLQFFHKKSKQQ 
HFDVQQYNKLK£DLAKAKESLKQARHSLCL 
QMQVEELNEKN 


1001 


2351 


A 


8410 


1400 


264 

- 


VGFWERPLRSSRWFRRSLRRWEMLARAARG 
TGALLLRGSLLASGRAPRRASSGLPRNTVVLF 
VPQQEAWVVERMGRFHRILEPGLNILIPVLDR 
IRYVQSLKEIVINVPEQSAVTLDNVTLQIDGV 

RSELGKLSLDKVFRERESLNASIVDAINQAAD 

CWGIRCLRYEIKDIHVPPRVKESMQMQVEAE 

RRKRATVLESEGTRESAINVAEGKKQAQILAS 

EAEKAEQINQAAGEASAVLAKAKAKAEAIRI 

LAAALTQHNGDAAASLTVAEQYVSAFSKLA 

KDSNTILLPSNPGDVTSMVAQAMGVYGALT 

KAP VPGTPD SLSSG S SRD VQGTD ASLDEELDR 

VKMS 


1002 


2352 


A 


8121 


134 


941 


— \id ittvtt t ccDA/iiwinprQvnvni "RTTTsJFPHlxTY 
NKENLLrJioKJVliVlL-'rC-o VU v v^livi i iNij^rxrv i i 

YTRHTGFKTLQELSSNDMLLLQLRTGMTLSG 

NNTICFHHVKIYIDRFEDLQKSCCDPFNIHKKL 

AKKNLHVIDLDDATFLSAKFGRQLVPGWKLC 

PKCTQUNGSVDVDTEDRQKRKPESDGRTAK 

ALRSLQFTNPGRQTEFAPETGKREKRRLTKN 

ATAGSDRQVIPAKSKVYDSQGLLIFSGMDLC 

DCLDEDCLGCFYACPACGSTKCGAECRCDRK 

WLYEQBEIEGGEIIHNKHAG 


1003 


2353 


A 


8427 


3 


1416 


TEWGLSGSCPGCSPLEPGSRGRGAAAWRILR 
CRRLPEPSPFLTQPNLAQSQPPAPVPVTDPSVT 
MHPAVFLSLPDLRCSLLLLV fWVFTPVTTEIT 
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VIA. rt-T 
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seq- 
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iNU. or 
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seq- 
uence 


Met 
nod 


SEQ 

WJ INU. 

in 
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914 
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beginning 

nucleotide 
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correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to iast amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=Phenyl alanine, G=Glycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine a N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valinc, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *^=Stop codoo, 
/=possible nucleotide deletion, ^possible 
nucleotide insertion 














SLDTENIDEILNNADVALVNFYADWCRFSQM 

LHPIFEEASDVIKEEFPNENQVVFARVDCDQH 

SD1AQRYRISKYPTLKLFRNGMN4MKREYRGQ 

RSVKALADYIRQQKSDPIQEIRDLAEITTLDRS 

KRNIIGYFEQKDSDNYRVFERVANILHDDCAF 

LSAFGDVSKPERYSGDNIIYKPPGHSAPDMVY 

LGAMTNFDVTYNW1QDKCVPLVREITFENGE 

ELTEEGLPFLILFILVfKEDTESLEIFQNEVARQL 

ISEKGTrNFLHADCDKFRHPLLHIQKTPADCP 

VIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDI, 

HSGKLHREFHHGPDPTDTAPGEQAQDVASSP 

PES SFQKL APSE YRYTLLRDRDEL 


1004 


2354. 


A 


8432 


910 


387 


GLSRKLRAGFLPGFCRVSPCGSWVVETLVKM 

ACAAARSPADQDRFICIYPAYLNNKKTIAEGR 

RJPISKAVENPTATEIQDVCSAVGLNVFLEKN 

KMYSREWNRDVQYRGRVRVQLKQEDGSLC 

LVQFPSRKSVMLYAAEMfPKJLKTRTQKTGGA 

DQSLQQGEGSKXGKGKKKK 


1 firths 




A 




90 


530 


QSHETICMQSGTHWRVLGLCLLSVGVWGQD 

GNEEMGGITQTPYKVSISGTTVILTCPQYPGSE 

ILWQHNDKNIGGDEDDKNIGSDEDHLSLKEF 

SELEQSGYYVCYPRGSKPEDANFYLYLRARG 

NPGLQNRYHRLFREDHSKGHSQ 


1006 


2356 


A 


8458 


3 


307 


AVQRIRHEMNfFRLTGDLSHLAAIVILLLKIW 
KTRSCAGISGKSQLLFALWTTRYLDLFTSFIS 
LYNTSMKVWYAIHRNVFHLQCTGLWTLNLC 
QLCrFN 


1007 


2357 


A 


8459 


43 


553 


GAGAGGDWAAMDKJLICKVLSGQDTEDRSGL . 

SEVVEASSLSWSTRIKGFIACFAIGILCSLLGT 

VLLWVPRKGLHLFAVFYTFGN1ASIGSTIFLM 

GPVKQLKRMFEPTRJLIATIMVLLCFALTLCSA 

FWWFINKGLALIFCILQSLALTWYSLSFIPFAR 

DAVKKCFAVCLA 


1008 


2358 


A 


8462 


487 


150 


AQDIRSVHSLGQKSTFVKHFRTLSHLHGLPDP 
PPHWPPQERSPPSHPCMPSHRPQIPQLSNSGPS 
DPR WG C VG PSMPTSTCLPG A VE ASTTKA SLP 
KCPVDSSLPTPEACFL 


i f\no \ 
I UUV 




A 


8465 


134 


954 


ETR VKTSLELLRTQLEPTGT VGNTIMTS QP VP 

NETIIVLPSNVINFSQAEKPEPTNQGQDSLKKH 

LHAEIKVIGTIQ1LCGMM\0.SLGIILASASFSPN 

FTQVTSTLLNSAYPFIGPFFFIISGSLSIATEKRL 

TfCLLVHSSLVGSILSALSALVGFIILSVKQATL 

NPASLQCELDKNNIPTRSYVSYFYHDSLYTTD 

CYTAXCASLAGTLSLMLICTLLEFCLAVLTAVL 

R WKQA Y S DFPGS VJLFLPHS YIGNS GMS S KMT 

HDCGYEELLTS 


1010 


2360 


A 


8468 


2 


473 


KYRYRRPYPVMRKICQVGPAGLAFILNISPVA 
HRVALCHLAGCQEQAAWYHTLQrLFFLVSAY 
FFSCPVPEKYFPGSCDIVGHGHQ1FFIAFLSICT 
LSQLEAILLDYQGRQEIFLQRHGPLSVHMACL 
SFFFLAACSAATAAI I RHTv VTsf ART Tk'KT)^ 


1011 


2361 


A 


8478 


5 


409 


TELSQLEI<AHPPADMGRRKSKRKPPPKKKMT 
GTLETQFTCPFCNHEKSCDVKMDRARNTGVI 
SCTVCLEEFQTPITCILGNLGFFQRVGRGLESG 
PCSSGPLCAL VQGQSRPEEQ VPPSDFCG VRRC 
RAGFQCQ 


1012 


2362 


A 


8481 


2810 


1652 


RTSTQKWQSVFNDSQEHLERFYCNPENDRM 
RMKYGGQEFWADLNAMKVYETTEFDQLRR 
LSTPPSSNVNSIYHTVWKFFCRDHFGWREYPE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID | 1 
NO: of 1 
peptide 
seq- 
uence 


Viet 
bod 


SEQ 

[DNO: 

in 
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09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 
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nucleotide 
location 
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to last amino 
acid residue 
of peptide 
sequence 


Ajnino acid sequence (A=Alamne C=Cysteine, 
D=Aspartic Acid, E~Glutamic Acid, 
F=Phenylalaninc 3 G=Glycinc, H=Histidinc, 
l=lso leucine, K=Lysine, L=Leucine, ! 
M^Methiontne, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 
















SVTRI JEEANSRG1XEVRPMMWNNHYILHNS 

PPL EATS S SQIICPDG VTS ANF YPETWVYMHP 

SQDFIQVPVSAEDKSYR1IYNLFHKTVPEFKYR 

ILQ1LRVQNQFLWEKYKRKKEYMNRKMFGR 

DRUNERHLFHGTSQDWDGICKHNFDPRVCG 

KHATMFGQGSYFAKKASYSHNFSKKSSKGV 

i iv7k /TT7T w\n Tr:T? VTlwrn^HnMRPPPPVNPGS 

VTSDLYDSCVDNFFEPQ1FV1FNDDQSYPYFVI 

QYEEVSNTVS1 




1013 


2363 


A 


848S 


2 


517 


" IENCRTRLRQAWHEVCGNKMAAPIPQOFSCL 
oner /-»n;i\/cr>rvD\/T VTO^ A ATVTATRTKKRFTP 
PIYQPKFKTEKJEFMQHARKAGLVIPPEKSDRS 
1HLACTAG1FDAYVPPEGDARISSLSKEGLIER 
TERMKKTMASQVSIRRIICDYDANFKIKDFPE 
KAKD1FIEGSPLY 


1014 


2364 


A 


8501 


363 


17 


YIRTGYVY1CHYAQLMYTYYIRTAYVY ICILY 
AQLMYTYVLYTHSLCIHMYSIRTAYVY1C11Y 
AQIMYTYVFYTHRLC1HMYS1RTDYVYICILY 
AOLMYTYVFYTHSYMSDE _J 


1015 


2365 


A 


8504 


3 


2190 


" NSSEHFSQAPQRLSFY SW YGS ARLFRFRVPPD 
AVLLRWLLQVSRESGAACTDAEITVHFRSGA 
PPVTNPLGTSFPDDTAVQPSFQVGVPLSTTPRS 
NASVNVSHPAPGDWFVAAHLPPSSQKIELKG 
LAPTCAYVFQPELLVTRVVE1SIMEPDVPLPQ 
TLLSHPSYLKVFVPDYTRELLLELRDCVSNGS 
LGCPVRLTVGPVTLPSNFQKVLTCTGAPWPC 
RLLLPSPPWDRWLQVTAESLVGPLGTVAFSA 
VAALTACRPRSVTIQPLLQSSQNQSFNASSGL 
LSPSPDHQDLGRSGRVDRSPFCLTNYPVTRED 
MD V V S VHFQPLDR V S VRV CSDTP S VMRLRL 
NTGMDSGGSLTISLRANKTEMRNETVVVACV 
NAASPFLGFNTSLNCTTAFFQGYPLSLSAWSR 
RANLIIPYPETDNWYLSLQLMCPENAEDCEQ 
AVVHVETTLYLVPCLNDCGPYGQCLLLRRHS 

YLYASCSCJvACj WKuWoti jjins i v-tt.v.jv^ 

AATLLLTLSNLMFLAPLA.VSVRRFFLVEASVY 

AYTMFFSTFYHACDQPGEAVLCILSYDTLQY 

CDFLGSGAAIWVTILCMARLKTVLKYVLFLL 

GTLVIAMSLQLDRRGMWNMLGPCLFAFVIM 

a px <y\r a vd prui>P nPVPT<sWHR W AFY1 .LPG 
ASM WAY KCCiriKKi^t- iriow^r\v\ s\r uibrvi 

VSMASVGIAIYTSMMTSDNYYYTHSIWHILL 
AGSAALLLPPPDQPAEPWACSQKFPCHYQIC 
KNDREELYAVT 


1016 


2366 


A 


8511 


1 


453 


KW YPSGPVRIPGRFYYKLPAGHRRCRMAPAK 

KGGEKKKGRSAINEVVTREYTmiHKRIHGVG 

FKKRAPRALKEIRKFAMKEMGTPDVRIDTRL 

NKAVWAKGIRNVPYRIRVRLSRKRNEDEDSP 

NKLYTLVTYVPVTTFKNLQTVNVDEN 


1017 


2367 


A 


8513 


54 


1196 


" LERTPASADMAWTKYQLFLAGLMLVTGSINT 
LSAKWADNFMAEGCGGSBCEHSFQHPFLQAV 
GMFLGEFSCLAAFYLLRCRAAGQSDSSVDPQ 
QPFNPLLFLPPALCDMTGTSLMYVALNMTSA 
SSFQMLRGAVIIFTGLFSVAFLGRRLVLSQWL 
GILATIAGLWVGLADLLSKIIDSQHKLSEVIT 
GDLLI1 N4AQII VMQMVLEEKFVYKHN VHPLR 
AVGTEGLFGFVILSLLLVPMYYIPAGSFSGNP 
RGTLEDALDAFCQVGQQPLIAVALLGNISSIA 
FFNFAGISVTKELSATTRMVLDSL_ITWIWAL 
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nuclpnt iHf* insprtidn 

UUVlvUllUw llldwJ 














SL ALGWEAFH ALQILGFLILLIGTAL YNGLHR 
PLLGRJLSRGRPLAEESEQERLLGGTRTPINDA 
S 


1018 


2368 


A 


8518 


324 


694 


SPFWTEKRRMEKPLrPLVPLHWFGFGYTALV 

VSGGIVGYVKTGSVPSLAAGLLFGSLAGLGA 

YQT..YQDPRNVWGFI.AATSVTFVGVMGMRS 

YYYGKFMPVGLJAGASLLMAAKVGVRMLM 

TSD 


1019 


2369 


A 


8526 


2 


1787 


VSAAAVNMEPPDAPAQARGAPRLLLLAVLL 

AAHPDAQAEVRLSVPPLVEVMRGKSVILDCT 

PTGTHDHYMLEWFLTDRSGAR.PRLASAEMQ 

GSELQVTMHDTRGRSPPYQLDSQGRJLVLAEA 

QVGDERDYVCVVRAGAAGTAEAAARLNW 

AKPEATEVSPNKGTLSVMEDSAQEIATSNSRN 

GNPAPKITWYRNGQRLEVPVEMNPEGYMTS 

RTVREASGLLSLTSTLYLRLRKDDRDASFHC 

AAHYSLPEGRHGRLDSPTFHLTLHYPTEHVQ 

FWVGSPSTPAGWVREGDTVQLLCRGDGSPSP 

EYTLFRLQDEQEEVLNVNLEGNLTLEGVTRG 

youJ Y vjCKVJbU r UAAi/L'V( v ;L5)rvi LbLKVAY 

LDPLELSEGKVLSLPLNSRAWNCSVHGLPTP 

ALRWTKDSTPLGDGPMLSLSS1TFDSNGTYVC 

EASLPTVPVLSRTQNFTLLVQGSPELKTAEIEP 

KADGSWREGDEVTLICSARGHPDPKLSWSQL 

GGSPAEPIPGRQGWVSSSLTLKVTSALSRDGI 

SCEASNPHGNKRHVFHFGTVSPQTSQAGVAV 

MAVAVSVGLLLLVVAVFYCVRRKGGPCCRQ 

RREKGAP 


1020 


2370 


A 


8530 


2 


. 1200 


PRVRLLRPSRSRSCRGLLSTRAPGPSPFRSLHS 

SPLLPHAMKSPFYRCQN1TSVEKGNSAVMGG 

VLFSTGLLGNLLALGLLARSGLGWCSRRPLR 

PLPSVFYMLVCGLTVTDLLGKCLLSPWLAA 

I AVJNKo-LK Vl^AJ^ALlJNoLCQAr At rMbr r CjJL 

SSTLQLLAMALECWLSLGHPFFYRRHITLRLG 

ALVAPWSAFSLAFCALPFMGFGKFVQYCPG 

TWf^TTTOA/rVT-IP'P'riQT QV1 f^V^/I VQQ1' AA A 1 1 \/ 
1 W V^r iV^ivl V nC-HUoLL!) V JLO IoVLI oo1_»ivJ/\JL,1_, V 

LATVLCNLGAMRNLYAMHRRLQRHPRSCTR 
DCAEPRADGREASPQPLEELDHLLLLALMTV 
LFTMCSLPVIYRAYYGAFKDVKEKNRTSEEA 
EDLRAERFL S VIS I VDP WI FI [FRSP VFRJFFHKI 
FIRPT R YRSRC'SN^TNJMF^Sf 


1021 


2371 


A 


8536 


1 


237 


RRGEIDMATEGDVELELETETSGPERPPEKPR 
KHDSGAADLERVTDYAEEICEIQSSNLETAMS 
V1GDRRSREQKAKQER 


1022 


2372 


A 


8537 


94 


541 


RKERRRRRRRMEAVVFVFSLLDCCALIFLSV 

I rill LjULCL-U I 1 INMKlM A 1 ,l\ K VV V i± ILL>1 v_J 

HT1VTVLLLMSLHWFIFLLNLPVATWNIYRYI 
M VPSGNMG VFDPTEIHN RGQLKSHMKE AMI 
KLGFHLLCFFMYLYSMILALIND 


1023 


2373 


A 


8540 


26 


431 


RMMKCPQALLAIFWLLLSWVSSEDKVVQSPL 

SLVVHEGDTXTTLNCSYEVTNFRSLLWYKQEK 

KAPTFLFMLTSSGIEKKSGRLSSILDKKELSSIL 

NITATQTGDSAIYLCAVEAQCSLVTCSLYSNS 

TAEALQL 


1024 


2374 


A 


8544 


1731 


743 


GVRLRYSPIAVVMVGEAGRDLRRRRAVAVT 

AEKMAVLAPLIALVYSVPRLSRWLAQPYYLL 

SALLSAAFLLVRKLPPLCHGLPTQREDGNPCD 

FDWREVEILMFLSA1VMMKNRRSITVEQHIGN 

IFMFSKVANTILFFRLDIRMGLLYITLCrVPLM 
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Y=Tyrosine, X=Unknown, *=Stop cod on, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














TCKPPLYMGPEYIKYFNDKTIDEELERDKRVT 
WIVEFFAN W SNDCQ SF API Y ADLSLK YN CTG 
LNFGKVD VGRYTD VSTRYK V STSPLTKQLPT 
LILFQGGKbAMlvivl i^UUrU^UKA v o Wli oL^n 
VIREFNLNELYQRAKKLSKAGDNIPEEQPVAS 
TPTTVSDGENKKDK 


1025 


2375 


A 


8546 


2194 


1707 


TVSFHKTMASLKCSTVVCVICLEKPKYRCPA 
CRVPYCSVVCFRKHKEQCNPETRPVEKKIRS 
ALPTKTVKPVENKDDDDSIADFLNSDEEEDR 

\Tdr /-\VTT rr-v 1T rrc ATT ncr i T "NTPWFT ROT NffVTvII 

VSEQNLKNLGEoA I LK&LLLN rnL,i\\^L,ivL v inl, 
DQGEDKAKLMRAYMQEPLFVEFADCCLGIV 
EPSQNEES 


1026 


2376 


A 


8547 


1078 


594 


VGMELPAVNLKVILLGH WLLTT W GCI VFSGS 

■\r x n 7 a ■vttt'TT at 1 T\\ T A A 7 A f"YD PlCTFt A T QN/TFI CXd 

Y A WAN r IIL ALCj V WA V A^KJJoHJAlomrLUO 
LLATIFLDIVHISIFYPRVSLTDTGRFGVGMAIL 
SLLLKPLSCCF V YHM YRERG GELL VHTGFLG 
SSQDRSAYQTIDSAEAPADPFAVPEGRSQDAR 
GY 


1027 


2377 


A 


8557 


1 


340 


DFLGPASPQEEGGSESSTMTELETAMGMIIDV 
FSRYSGSEGSTQTLTKGELKVLMEKELPGFLQ 
SGKDKDAVDKLLKDLDANGDAQVDFSEFIVF 
VAAITSACHKYFEKAGLK 


1028 


2378 


A 


S569 


20 


963 


KMAATLGPLGS WQQ WRRCLS ARDG S RRL LL 

LLLLGSGQGPQQVGAGQTFEYLKREHSLSKP 

YQGEAPRPCFLRD WELQ VHFKIHGQGKJCN L 

HGDGLAIWYTKDRMQPGPVFGNMDKFVGLG 

VF VDTYPNEEKQQERVFP Y I S AM VNN G S LS Y 

DHERDGRPTELGGCTAI VRNLHYDTFE VI R Y 

VKRHLTIMMDIDGKHEWRDCIEVPGYRLPRG 

YYFGTSSITGDLSDNHDVISLKLFELTVERTPE 

EEKLHRDVFLPSVDNMKLPEMTAPLPPLSGL 

ALFLI\TFSLVFS VFAI VIGIIL YNK WQb^bKJv 

RFY 


1029 


2379 


A 


8572 


1 


578 


AAAASHRSRARSRPRRVSSGPAPRRAQSSAG 

RVASGLDSAPLCTMARALCRLPRRGLWLLLA 

HHLFMTTACQEANYGALLRELCLTQFQVDM 

EAVGETLWCDWGRTIRSYRELADCTWHMAE 

KLGCFWPNAEVDRFFLAVHGRYFRSCP1SGR 

AVRDPPGSILYPFIVVPITVTLLVTALVVWQS 

KRTEGIV 


1030 


23S0 


A 


8574 


1352 


372 


DSSTVKGGSESRHLCLIPDLKGKARTREASSG 

SRTCGRRTSLCTSAKSSWTYRSGRLSWQSIKG 

THLTITQALRQPLHRAPLLPGQLC W brKrLfc/K 

NKAMGRPLLLPLLLLLQPPAFLQPGGSTGSGP 

SYLYGVTQPKHLSASMGGSVEIPFSFYYPWEL 

AIVPNVRISWRRGHFHGQSFYSTRPPSIHKDY 

VNRLFLNWTEGQESGFLRISNLRKEDQSVYF 

CRVELDTRRSGRQQLQSIKGTKLTITQAVTTT 

TTttmnccTTTT APT 13 'X/TTTCl/ f^UCFQYWTJl QT T~YT 

AIRVALAVAVLKTVILGLLCLLLLWWRRRKG 
SRAPSSDF 


1031 


2381 


A 


8580 


905 


340 


RRTAGIYPCFPKPGRTRHALCSVVLLLLTGQL 

AFDDFQESCAMMWQKYAGSRRSMPLGARIL 

FHGVFYAGGFAIVYYLIQKFHSRALYYKLAV 

EQLQSHPEAQEALGPPLNIHYLKLIDRENFVDI 

VDAKLKIPVSGSKSEGLLYVHSSRGGPFQRW 

HLDEVFLELKDGQQEPVFKLSGENGDEVKKE 


1032 


2382 


A 


8593 


2558 


961 


I^RPRLLPGAEPCEPRVGPRRADMGCSAKAR 
WAAGALGVAGLLCAVLGAVMIVMVPSLIKQ 
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Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possiblc 
nucleotide insertion 














QVLKNVRIDPSSLSFNMWKErPIPFYLSVYFFD 

VMNPSEJLKGEKPQVRERGPYVYREFRHKSNI 

TFNNNDTVSFLEYRTFQFQPSKSHGSESDYIV 

M PNI L VLG AA VMMENKPMTLKLIX4TL AFTTL 

GERAFNINRTVGEIMWGYKDPLVNLINKY'FP 

GMFPFKDKFGLFAJELNNSDSGLFTGFTGVQNI 

i j rvi 1 1 j_. v \j cv w in v_i v ur vv jno J J wi^* iviain vj x o 
GQMWPPFMTPESSLEFYSPEACRSMKJLMYKE 
^GVFFGIPTYRFVAPKTI FA'NJO^TYPPTsJFfJFr'P 
CLESGIQNVSTCRFSAPLFLSHPHFLNADPVL 
AEAVTGLHPNQEAHSLFLDIHPVTGIPMNCSV 
KLQLSLYMKSVAGIGQTGKIEPVVLPLLWFA 
E SG A MEGETLHTFYTQL VLMPK VMHY AQ Y V 
LLALGCVLLLVPVICQIRSQEKCYLFWSSSKK 
GSKDKEAIOAYSE^LMTSAPKGSVT OFAICI 


1033 


2383 


A 


8595 


595 


767 


AHLPDTLLLPPHSPTVPTPKSFQCSQKACFSRS 
FCLLLSLVSSSLVSLSLCPPLTQA 


1034 


2384 


A 


8597 


640 


164 


VTTSCIIPFAFGLGVRASERLAE1DMPYLLKYQ 

PMMQTIGQKYCMDPAVIAGVLSRKSPGDKIL 

VNMGDRTSMVQDPGSQAPTSWISESQVFQTT 

FVT TTRTTFT ORRPPTWTPnOYT Rn^I pAVQr, 

GAGYVRSSQDLSCDFCNDVLARA1CYLKRHG 

F 


1035 


2385 


A 


8603 


936 


204 


AMASTLEYSPSPLRRLVGPAAGFSRAARADL 
S WDPMAFFTGLWGPFTC VSRVLSHHCFSTTG 
SLSAIQKMTRVRVVDNSALGNSPYHRAPRC1 

HCMPGPRMTPRFDSNNVVLIEDNGNPVGTRI 
KTPIPTSLRKREGEYSKVLA1AQNFV 


1036 


2386 


A 


8606 


1 


562 


PI RAHSFDLCC S PCRRRLLGREE AG E EPTSP V 

TQYLQPRSPEECKMFACAKLACTPSLIRAGSR 

VAYRPISASVLSRPEASRTGEGSTVFNGAQNG 

VSQLIQREFQTSAISRDIDTAAKFIGAGAATVG 

VAGSGAGIGTVFGSLIIGYARNPSLKQQLFSY 

AILGFALSEAMGLFCLMVAFULFAM 


1037 


2387 


A 


8615 


2 


2364 


SPGPSLPESAESLDGSQEDKPRGSCAEPTFTDT 

GMVAHINNSRLKAKGVGQHDNAQNFGNQSF 

EELRAACLRKGELFEDPLFPAEPSSLGFKDLG 

PNSKNVQNISWQRPICDIINNPLFIMDGISPTDI 

CQGILGDCWLLAAIGSLTTCPKLLYRWPRG 

QSFICKNYAGIFHFQIWQFGQW\nv3VVVDDRL 

PTK^DKXVFVHSTERSEFWSALLEICAYAKLS 

GSYEALSGGSTMEGLEDFTGGVAQSFQLQRP 

PQNLLRLLRKAVERSSLMGCSIEVTSDSELES 

MTDKML\nRGHAYSVTGLQDVHYRGKMETLl 

RVRNPWGRIEWNGAWSDSAREWEEVASDIQ 

MQLLHKTEDGEFW^VISYQDFLNNFTLLEICNL 

TPDTLSGDYKSYWHTTFYEGSWRTGSSAGGC 

RNHPGTFWTNPQFKISLPEGDDPEDDAEGNV 

WCTCLVALMQKNWRHARQQGAQLQTIGFV 

LYAVPKEFONIODVHLKKEFFTKY'ODHGFSFI 

FTNSREVSSQLRLPPGEYIIIPSTFEPHRBADFL 

LRVKFEKHSESWELDEVNYAEQLQEEKVSED 

DMDQDFLHLFKIVAGEGKEIGVYELQRLLNR 

MAIKFKSFKTKGFGLDACRCMINLMDKDGSG 

KLGLLEFKILWKKLKKWMDIFRECDQDHSGT 

LNSYEMRLVIEKAGIKLNNKVMQVLVARYA 

DDDLIIDFDSFISCFLRLKTMFTFFLTMDPICNT 

GH1CLSLEQVLGEGWEG1CRIAPACPSTPPPPS 
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SEQ ID 
"NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid. E=GIutamic Acid, 
F=PhenyIalanine, G=Glycine, IWIistidine, 
l=lso leucine, iv— JL-ysine. L* l-«cucuicj 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T= Threonine. V=Valine, W=Tryptophan, 
Y^Tyrosine, X=Unknown } *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














SD VPGP ASCPRLFPP WDLLP VSTV AADDH V Gl 
EAL 


103S 


238S 


A 


8621 


3 


1494 


"RSRMARAPLGVLLLLGLLGRGVGKNEELRLY 
HHLFNN YDPG SRP VREPEDTVTISLKVTLTNL 
1SLNEKEETLTTSVWIGIDWQDYRLNYSKDDF 
GGIF.TLRVPSELVWLPEIVLENNIDGQFGVAY 
DAN VL VYEGGS VTWJLPPAIYRS VCAVE VTYF 
PFDWQNCSLIFRSQTYNAEEVEFTFAVDNDG 
KTINKIDIDTEAYTENGEWAIDFCPGVIRRHH 
GGATDGPot 1 DVI YbEllKJ<J < vrJLr x virMii vrw 
LISGLVLLAYFLPAQAGGQKCTVSINVLLAQT 
VFLFL1AQKIPETSLSVPLLGRFLIFVMVVA1L1 
VMNCVIVLNVSQRTPTTHAMSPRLRHVLLEL 
LPRLLGSPPPPEAPRAASPPRRASSVGLLLRAE 
ELILKKPRSELVFEGQRHRQGTWTAAFCQSL 
GAAAPEVRCCVDAVNFVAESTRDQEATGEE 
VSDWVRMGNALDNICFWAALVLFSVGSSLIF 
LGAYFNRVPDLPYAPCIQP 


1039 


2389 


A 


8636 


1 


900 


PGRERPGGGGARRRPQHLPALLPSERPDCATL 

QAMENELPVPHTSSSACATSS 1 buAbbbbuUN 

NSSSGGSGRPTGPQISVYSGIPDRQTVQVIQQ 

ALHRQPSTAAQYLQQMYAAQQQHLMLQTA 

ALQQQHLSSAQLQSLAAVQQASLVSNRQGST 

SGSNVSAQAPAQSSSINLAASPAAAQLLNRA 

QSVNSAAASGIAQQAVLLGNTSSPALTASQA 

QMYLRAQMLIFTPTATVATVQPELGTGSPAR 

PPTPAQVQNLTLRTQQTPAAAASGPTPTQPVL 

PSLALKPTPGGSQPLPTPA 


1040 


2390 


A 


8645 


98 


1388 


ASQLAFGGKLTSTPSRDFQGCGRGAVTCCSF 

HEHRHQSGRCLSTGMAPNLKGRPRKKKPCPQ 

RRDSFSGVKDSNNNSDGKAVAKVKCEARSA 

LTKPKNNHNCKKV SNEEKPKVAIGEECRADE 

QAFLVALYKYMICERKTPIERIPYLGFKQINLW 

TMFQAAQKLGGYE1 1 1 AKKQ W Kril Y Ue,iajU 

NPGSTSAATCTRRHYERL1LPYERFIKGEEDICP 

LPPIICPRKQENSSQENENKTKVSGTKR1KHE1P 

KSKKEKENAPKPQDAAEVSSEQEKEQETLISQ 

KSIPEPLPAADMKKKIEGYQEFSAKPLASRVD 

PEKDNETDQGSNSEKVAEEAGEKGPTPPLPSA 

PLAPEKDSALVPGASKQPLTSPSALVDSKQES 

KLCCFTESPESEPQEASFPRLPHHTGHRWQTR 

MRRRMTNCPPWQITLPTAP 


1041 


2391 


A 


8646 


113 


1492 


LLQEMCTKTIPVLWGCFLLWNLYVSSSQTIYP 

GIICARITQRALDYGVQAGMKVDEQMLKEKK 

LPDLSGSESLEFLKVDYVNYNFSNIKISAFSFP 

NTSL AF VPG VG1KAT /TNHGT ANISTD WGFESP 

LFVLYNSFAEPMEKPILKNLNEMLCP11ASEVK 

ALNANLSTLE VLTKIDN YTLLDYSLIS SPEITE 

NYLDLNLKAjVF YPLblNJLlDrrr or vrr vj^i j&k. 

SNSMLYIGIAEYFFKSASFAHFTAGVFNVTLS 

TEEISNHFVQNSQGLGNVLSRIAEIYILSQPFM 

VRIMATEPPIINLQPGNFTLDIPASIMMLTQPK 

NS1VET1VSMDFVASTSVGLVILGQRLVCSLS 

LNRFRLALPESNRSNmVLRFENILSSILHFGVL 

PLANAKLQQGFPLPNPFUCFLFVNSDIEVLEGF 

LLISTDLKYETSSKQQPSFHVWEGLNLISRQW 

RGKSAP 


1042 


2392 


A 


8672 


538 


170 


ARRlAJlTRESKAAVSQDNVPALQPGKiCKKLR 
LGGKJOOCFKFFRLPKEFKJCQLMYSPSNrXKM 
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SEQ ID 


SEQ ID 


Met 


SEQ 


Predicted 


Predicted end 


Amino acid sequence (A=Alanine OCysteine, 


NO: of 


NO: of 


hod 


ID NO: 


beginning 


nucleotide 


D = Aspartic Acid. E = GJutamic Acid, 


nucl- 


peptide 




in 


nucleotide 


location 


F— Phenylalanine, G = 01ycine, H = Histidine, 


eotide 


seo- 




USSN 


location 


corresponding 


I—lsoleucine, K=Lysine. L=Leucine, 


seq- 


uence 




09/496 


conespondi 


to last amino 


M=Methionine, N=Asparagine, P=Proline J 


uence 






914 


ng to first 
amino acid 
residue of 
peptide 
sequence 


acid residue 
of peptide 
sequence 


Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine : V=Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














TSLAGNTVQCLNKLKYVIYSAQYPAYGN1TT 














LDMITSTDHVLEQDFWICFTFYSVKJERQI 


1043 


2393 


A 


868S 


359 


17 


GLKTRAPATPTFQREVLGPAKQDMQRRCPR1 
GLMTSLLKPIKRRWRDYKRUTCSGGFTGESC 
HHADTLGDRGGLQGDHSELLQWQKRJLRTE 
GEPSPKYISKNIFPICSYITGFL 


1044 


2394 


A 


8718 


292 


1490 


GTVKTSVATPIl'AGHSCSSGGVLQVKSPATQS 
GFKFTSKMEDFNMESDSFEDFWKGEDLSNYS 
YSSTLPPFLLDA.APCEPESLEINKYFVV1IYAL 
VFLLSLLGNSLVMLVILYSRVGRSVTDVYLL 

NI A 7 ADI T FAT TI PTWA A^K VNIHWIFfiTFI C 

KVVSLLKEVNFYSGILLLACISVDRYLA1VHA 

TRTLTQKRYLVKF1CLSIWGLSLLLALPVLLFR 

RT\TSSNVSPACYEDMGNNTANWRMLLRIL 

PQSFGFIVPLLIMLFCYGFTLRTLFKAHMGQK 

HRAMRVIFAVVLIFLLCWLPYNLVLLADTLM 

RTQV1QETCERRNHIDRAXDATEILGILHSCLN 

PLIYAFIGQKFRHGLLKILAIHGLISKDSLPKDS 

RPSFVGSSSGHTSTTL 


1045 


2395 


A 


8724 


254 


3184 


FRANLAITVANRRGAQGGKMHTCCPPVTLEQ 

DLIIRKMHSWMLQTLAFAVTSLVLSCAETIDY 

YGE) CDNACPCEEKDG ILT VSCENRGII SLSEIS 

PPRFP1YHLLLSGNLLNRLYPNEFVNYTGASIL 

HLGSNVIQDIETGAFHGLRGLRRLHLNNNKL 

ELLRDDTFLGLENLEYLQVDYNY1SVIEPNAF 

GKLHLLQVLILNDNLLSSLPNNLFRFVPLTHL 

DLRGNRLKLLPYVGLLQHMDKWELQLEEN 

PWNCSCELISLKDWLDS1SYSALVGDVVCETP 

FRLHGRDLDEVSKQELCPRRLISDYEMRPQTP 

LSTTGYLHTTPASVNSVATSSSAVYKPPLKPP 

KGTRQPNKPRV RPTSRQPSKJ3LG Y S N YGPS 1A 

YQTKSPVPLECPTACSCNLQISDLGLNVNCQE 

RKIESIAELQPKPYNPKJCMYLTENYIAVVRRT 

DLLE ATGLDLLHLGNNRI SMI QDRAFGDLTN 

LRRLYLNGNR1ERLSPELFYGLQSLQYLFLQY 

NLIREIQSGTFDPVPNLQLLFLNNNLLQAMPS 

GVFSGLTLLRLNLRSNHFTSLPVSGVLDQLKS 

LIQIDLHDNPWDCTCDIVGMKLWVEQLKVG 

VLVDEVICKAPICKFAETDMRSIKSELLCPDYS 

DVWSTPTPSSIQVPARTSAVTPA\OtLNSTGA 

PASLGAGGGASSVPLSVLILSLLLVFIMSVFVA 

AGT FV1 VMKRRKKNO^DHTSTMN^DV^^FlvJ 

/T^VJ Ijl V O V 1VI rVIVJVJCVJVl >( V^/ uJi_VXU Jul 1\1N UlV V Oul IN 

MQYSVYGGGGG TGGHPHAHVHHRGPALPK 
VKTPAGHVV'EYIPHPLGHMCKNPIYRSREGN 
SVEDYKDLHELKVTYSSNI-IHLQQQQQPPPPP 
QQPQQQPPPQLQLQPGEEERRESHHLRSP A Y S 
VSTIEPREDLLSPVODADRFYRGIl FPDKHCST 
TPAGNSLPEYPKFPCSPAAYTFSPNYDLRRPH 
QYLHPGAGDSRLREPVLYSPPSAVFVEPNRNE 
YLELKAKLNVEPDYLEVLEKQTTFSQF 


1046 


2396 


A 


8736 


28 


452 


SPSAAGGLAWVSLALGSGSRGRDHSGSGVGT 

AMAGALVRJsAADYVRSKDFRDYLMSTHFW 

GPVANWGLPIAAINDMKJCSPEIISGRMTFALC 

CYSLTFMRFAYKVQPRNWLLFACHATNEVA 

QLIQGGRLIKHEMTKTASA 


1047 


2397 


A 


8741 


673 


924 


ALPGTPQQTVTLNTDGKVKSFTSPHSNPNLPP 
AKFFTSLQSLNWSSHLPPSPATESVGKRGNAK 
PPTTKLLHSSPLWNFFAQQL 


1048 


2398 


A 


8747 


3 


5054 | 


PEVTIXPSLSQPTAASPIGSSPSPPVNGGNNAKR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
seginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteme, 

t-\ a n nai4ii> A r*\r\ P=f* 1 1 itJllTI if* A.f*1fl 

U^Aspanic /vciu, E*=vjiuiaiiiiw ^-v^ju, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, \V=Tryptophan ? 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 
















VAVPNGQPPSAARYMPREVPPRFRCQQDHK 

VLLKKGQPPPPSCMLLGGGAGPPPCTAPGAN 

PNNAQVTGALLQSESGTAPDSTLGGAAASNY 

ANSTWGSGASSNNGTSPNP1H1WDKVIVDGS 

DMEEWPCIASKDTESS SENTTDNNSASNPGSE 

KSTLPGSTTSNKGKGSQCQSASSGNECNLGV 

WKSDPKAK S VQS SNSTTENNNGLGN WRN V S 

GQDRIGPGSGFSNFMPNSNPSAWPALVQEGTS 

RKGALETDNSNSSAQVSTVGQTSREQQSKME 

NAGVNFVVSGREQAQIHNTDGPKNGNTNSL 

NLS SPNPMENKGMPFGMGLGNTSRSTD APSQ 

STGDRKTGSVGSWGAARGPSGTDTVSGQSNS 

GNNGNNGKEREDSWKGASVQKSTGSKNDS 

WDNNNRSTGGSWNFGPQDSNDNKWGEGNK 

MTSGVSQGEWKQPTGSDELKIGEWSGPNQPN 

SSTGAWDNQKGHPLLENQGNAQAPCWGRSS 

SSTGSEVEGQSTGSNHKAOSSDSHNSGRRSY 

RPTHPDCQAVLQTLLSRTDLDPRVLSNTGWG 

QTQTKQDTVWDTEEVPRPEGKSDKGTEGWES 

AATQTKMSGGWGDAPSQSNQMKSGWGELS 

ASTEWKDPKNTGGWNDYKNNNSSNWGGGR 

PDEKTPSSWNENPSKDQGWGGGRQPNQGWS 

SGKNGWGEEVDQTKNSNWESSASKPVSGWG 

EGGQNEIGTWGNGGNASLASKGGWEDCKRS. 

PAWNETGRQPNSWNKQHQQQQPPQQPPPPQ 

PEASGSWGGPPPPPPGNVRPSNSSWSSGPQPA 

TPKDEEPSGWEEPSPQSISRKMDIDDGTSAWG 

DPNSYNYKNVNLWDKNSQGGPAPREPNLPTP 

MTSKSASDSKSMQDGWGESDGPVTGARHPS 

WEEEEDGGVWNTTGSQGSASSHNSASWGQG 

GKKQMKCSLKGGNNDSWMNPLAKQFSNMG 

LLSQTEDNPSSKMDLSVGSLSDKKFDVDKRA 

MNLGDFNDIMRKDRSGFRPPNSKDMGTTDS 

GPYFEKGGSHGLFGNSTAQSRGLHTPVQPLN 

SSPSLRAQVPPQFISPQVSASMLKQFPNSGLSP 

GLFNVGPQLSPQQIAMLSQLPQIPQFQLACQL 

LLQQQQQQQLLQNQRKISQAVRQQQEQQLA 

RMVSALQQQQQQQQRQPGMICHSPSHPVGPK 

PHLDMMVPNALN VGLPDLQTKGPIPG YG SGF 

SSGGMDYGMVGGKEAGTESRFKQWTSMME 

GLP S VATQE ANMHKNG AI V APGKTRG G SPY 

NQFDIIPGDTLGGHTGPAGDSWLPAKSPPTNK 

IGSKSSNASWPPEFQPGVPWKGIQNIDPESDP 

YVTPGSVLGGTATSPIVDTDHQLLRDNTTGS 

NSSLNTSLPSPGAWPYSASDNSFTNVHSTSAK 

TTir\A/x^cT\i7CDnPTriT4>jPTWl ^XIKMWKNUTSS 

rrt/lKb 1 W or Ur IVjrllN Jr l Xll-»OiNX\J.vi wivnniuu 

RNTTPLPRPPPGLTNPKPSSPWSSTAPRSVRG 

WGTQDSRLASASTWSDGGSVRPSYWLVLHN 

LTPQIDGSTLRTICMQHGPLLTFHLNLTQGTA 

LIRYSTKQEAAKAQTALHMCVLGNTTILAEF 

ATDDEVSRFLAQAQPPTPAATPSAPAAGWQS 

LETGQNQSDPVGPALNLFGGSTGLGQWSSSA 

GGSSGADLAGASLWGPPNYSSSLWGVPTVED 

PHRMGSPAPLLPGDLLGGGSDSI 


1049 


2399 


A 


8748 


200 


1387 


" VPWKRQDEQLSLQVETLYLDSPAVIHLLSPTF 
LPPSSLPPFLQIVDSSSSACTLDSFFPFLAPWDS 
PQDCGFKDHQPLTLQALTVELARWTLMLLLS 
TAMYGAHAPLL ALCHVDGRVPFRPS S A VLLT 
ELTKLLLCAFSLLVGWQAWPQGPPPWRQAA 
PFALSALLYGANNNLVIYLQRYMDPSTYQVL 
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NO: of 
nucl- 
eotide 
seq- 
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SEQ ID 
NO: of 
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seq- 
uence 


Met 
hod 


SEQ 
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il-' JNU ■ 

in 

USSN 
09/496 
914 


Predicted 
uc£z liming 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
nentide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OtJysteine, 
D— Aspartic Acid, E=GIutamic Acid, 
F-=Pheny I alanine, G=Glycine, H=Histidine, 
l=Isoleucinc, K=Lysine. L=Leucine, 
M-Mcthionine, N=Asparagine, P=Proline, 
Q^Glutamine, R=Argininc, S=Serine, 
T=Threonine, V=Val]'ne, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 

f puwiuic 1IUUCUUUC ucicuuu, \ — pubbiDic 

nucleotide insertion 














SNLKIGSTAVLYCLCLRHRLSVRQGLALLLL 

MAAGACYAAGGLQVPGNTLPSPPPAAAASP 

MPLHITPLGLLLLILYCL1SGLSSVYTELLMKR 

QRLPLALQNLFLYTFGVLLNLGLHAGGGSGP 

GLLEGFSGWAALVVLSQALNGLLMSAVMKH 

G SS JTRJLF V VS CSLV VNA VLSA VLLRLQLTAA 

FFLATLLIGLAMRLYYGSR 


1050 


2400 


A 


8758 


3 


1660 


W V S SMGFEELLEQ VGGFGPFQLRN VALL ALP 

RVLLPLHFLLPIFLAAVPAHRCALPGAPANFS 

HQDVWLEAHLPREPDGTLSSCLRFAYPQALP 

NTTLGEERQSRGELEDEPATVPCSQGWEYDH 

SEFSST1 ATESQ WDL VCEQKGLNRA A STFFFA 

G VL VG A VAFG YLSDRFGRRRLLL V A YVS TL V 

LGLASAASVSYVMFAITRTLTGSALAGFTIIV 

MPLELEWLDVEHRTVAGVLSSTFWTGGVML 

LALVGYLIRDWRWLLLAVTLPCAPG1LSLWW 

Vh'JbbAlvWLL J vuri VlvbArlKYLLriCARLNGR 

PVCEDSFSQEAVSKVAAGERVVRRPSYLDLF 

RTPRLRHISLCCVVVWFGVNFSYYGLSLDVS 

GLGLNVYQTQLLFGAVELPSKLLVYLSVRYA 

G RRLTQ AGTLLGT AL AFGTRLLVSSDMKS WS 

TVLAVMG KAFSEAAFTTAYLFTSEL YPT VLR 

QTGMGLTALVGRLGGSLAPLAALLDGVWLS 

LPKLTYGGIALLAAGTALLLPETRQAQLPET1 

QDVERKSAPTSLQEEEMPMKQVQN 


1051 


2401 


A 


8759 


515 


1625 


EIRTPVAVSSAPSGDSEGDEEETTQDEVSSHTS 

EEDGGVVKVEKELENTEQPVGGNEWEHEV 

TGNLNSDPLLELCQCPLCQLDCGSREQLIAHV 

YQril AAVVbAKb YMCrVCGRALSSPGSLGR 

HLLIHSEDQRSNCAVCGARFTSHATFNSEKLP 

EVLNMESLPTVHNEGPSSAEGKD1AFSPPVYP 

AGILLVCNNCAAYRKLLEAQTPSVRKWALRR 

QNEPLEVRLQRLERERTAKKSRRDNETPEERE 

VRRMRDREAKRLQRMQETDEQRARRLQRDR 

EAMRLKRANETPEKRQARLIREREAI-CRLKRR 

LEKMDMMLRAQFGQDPSAMAALAAEMNFF 

QLPVSGVELDSQLLGKMAFEEQNSSSLH 


1052 


2402 


A 


8763 


1106 


70 


RHGHGGRDRRGGGRVARPGGLGRYPGRGAA 
ASLVFVPTRRRSGPSGTASVAAMAYHSGYGA 
HGSICHRARAAPDPPPLFDDTSGGYSSQPGGY 

T> A TY1 A T^\/ A T7C\/XTTJTr T rtr>D\/( A \T\/ A X / A VP CCl 

1 A 1 vJ/vU V Ar o VJNnJLi^Oi^riS/jAN VAMAYGoSl 

ASHGFCDMVHKELHRFVSVSK.LKYFFAVDTA 

YVAKXLGLLVFPYTHQNWEVQYSRDAPLPP 

RQDLNAPDLYIPTMAFITYVLLAGMALGIQK 

RFSPEVLGLCASTALVWVYMEVLALLLGLYL 

ATVRSDLSTFHLLAYSGYKYVGMILSVLTGL 

LFG SDG Y YVALA WTSSALMYFI VRSLRTAA L 

GPDSMGGPVPRQRLQLYLTLGAAAFQPLI1Y 

Wl TPHI VI? 
WLlr nly V in. 


1053 


2403 


A 


8768 


2 j 


712 


RPPRVWYPELRELSAAAPRWSHRTAPGIMVF 
YFTSSSVNSSAYTIYMGKDKYENEDI FKFIGW 
PEDIWFHVDKLSSAHVYLRLHKGENIEDIPKE 
VLMDCAHLVKANSIQGCKMNNVNVVYTPW 
SNLKKTADMD VG QIGFHRQKD VKI VTVEKK 
VNEILNRLEKTKVERFPDLAAEKECRDREER 
NEKXAQIQEMKKREKEEMKXKREMDELRSY 
SSLMKVENMSSNQDGNDSDEFM 


1054 


2404 


A 


8769 


344 


527 


REATTLACRNSCWVFSRCSLGACKPTVCSMP 
SLSRQGSQTLCLRLAEYCMESVDSQRLLLS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 1 
USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine.. H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=VaIine, W=Tryp t0 P nan > 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


1055 


2405 


A 


8770 


430 


3104 


QQESP AAOAAJKJVIIn Ulv-fcvj l Uoo^vjv^i\.viiNi>»iijs. 

KMLKCVWGDGAVGKTCLLMSYANDAFPEE 

YVPTVFDHYAVTVTVGGKQHLLGLYDTAGQ 

EDYNQLRPLSYPNTDVFLICFSVVNPASYHNV 

QEEWVPELKDCMPHVPYVLIGTQIDLRDDPK 

TLARLLYMKEKPLTYEHGVKLAKAIGAQCYL 

EGHSCCSI1 


1056 


2406 


A 


8773 


261 


332 


NPRIQLSGNSCCAGSCRVWLSEQ 


1057 


2407 


A 


877S 


3 


477 


PA G JRHEQ ARG A DRM GKCRGI ,RTARKLRSH 
RRDQKWHDKQYKKAHLGTALKANPFGGAS 
HAKGIVLEKVGVEAKQPNSAIRKCVRVQLIK 
NGKKITAFVPNDGCLNFIEENDEVLVAGFGR 
KGHAVGDIPGVRFKVVKVANVSLLALYKGK 
KERPRS 


1058 


2408 


A 


8808 


171 


881 


PGLSQEPSGSMETVVIVAIGVLATIFLASFAAL 

VLVCRQRYCRPRDLLQRYDSKPIVDLIGAME 

TQSEPSELELDDVVITNPHIEAILENEDWIEDA 

SGLMSHCIAILK1CHTLTEKLVAMTMGSGAK 

MKTSASVSDIIVVAKRISPRVDDVVKSMYPPL 

DPKLLDARTTALLLSVSHLVEY 1 KNALHL l u 

GLDWlDQSLSAAEEHJLbVLKJLAAi-AoiiJriJJvo 

LPGPEGFLQEQSAI 


1059 


2409 


A 


8809 


246 


757 


MRLQGAIFVLLPHLGP1LVWLFTRDHMSGWC 

EGPRMLSWCPFYKVLLLVQTAIYSWGYASY 

LVWKDLGGGLGWPLALPLGLYAVQLTISWT 

VLVLFFTVHNPGLALLHLLLLYGLVVSTAL1 

WHPINK.LAALLLLPYLAWLTVTSALTYI-ILWR 

DSLCPVHQPQPTEKSD 


1060 


2410 


A 


8810 


304 


381 


PKLSVYPLQSHHCLSEPFQSLVCCLA 


1061 


2411 


A 


8820 


1673 


848 


SCKTENLLEMWWFQQGLSFLPSALV1W1 oAA 

FIFSYITAVTLHHIDPALPYISDTGTVAPEKCLF 

GAMLNIAAVLCIATIYVRYKQVHALSPEENVI 

IKLNKAGLVLGILSCLGLSIVANFQKTTLFAA 

HVSGAVLTFGMGSLYMFVQTILSYQMQPKIH 

GKQVFWIRLLLVIWCGVSALSMLTCSSVLHS 

GNFGTDLEQKLHWNPEDKGYVLHMITTAAE 

WSMSFSFFGFFLTYTRDFQKISLRVEANLHGL 

TLYDTAPCPIKNERTRLLSRDI 


1062 


2412 


A 


8824 


1 


763 


GGAPPASVPARESPVSvjAQubbK l KuHJsJ</v/\ 

GARAPQLCSSWQRRSAPAMSRGLQLLLLSCA 

YSLAPATPEVKVACSEDVDLPCTAPWDPQVP 

YTVSWVKLLEGGEERMETPQEDHLRGQHYH 

QKGQNGSFDAPNERPYSLKIRNTTSCNSGTYR 

CTLQDPDGQRNLSGKVILRVTGCPAQRKEET 

FKKYRAEIVLLLAL VlJr YL.IL.llr i LrvrAKLyoi 

FPDFSKAGMERAFLPVTSPNKHLGLVTPHKT 

ELV 


1063 


2413 


A 


8826 


147 


627 


CETSTSSAGHAPCRrlAAQGPPAEPTGLRLCSE 

HQRLHAWPPGPRRPSLWPPKNGKWHSGKRT 

AGGRPQRRPSRRQSQRPSAWSGSPRMHSPGQ 

KCSLMCPHRSQDSLSTAIFQRSPGANTGRALH 

CVLSKEMICSVQRSLGLSRIHLQSKRKJIHFVL 

TR 


1064 


2414 


A 


8835 


2982 


1869 


LICDTLKLSQMTQEASDEAEDMKEAMNRMIDE 
LNKQVSELSQLYKEAQAELEDYRKRKSLEDV 
TAEYn^KAEIIEKLMQLTNVSRAKAEDALSE 
MKSQY SKVLNELTQLKQLVDAQKENS VSITE 
HLQ VITTLRTAAKEMEEKI SNLKEHL ASKEVE 
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T/US01/03800 



SEQ ID 

"MfY r\f 
lNLJ . OI 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

OI 

peptide 
seq- 
uence 


Met 

hrkH 
11 ou 


SEQ 
in wrv 

in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 

i>Ct] UCI1L.C 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine (^Cysteine, 
D = Aspartic Acid, E—Glutamic Acid, 
^Phenylalanine, G=GIycine, H=Histidine, 
HIsoleucinc, K=Lysine, L=Lcucinc. 
M=Methionine, N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan 5 
Y=Tyrosme, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, \=possible 

nuwicuuuc liiDcrtiun 














VAKLEK QLLEEKAAMTDAM VPRS S YEKJLQS 

SLESEVSVLASKLKESVKEKEKVHSEVVQ1RS 

EVSQVICREKEN1QTLLKSKLEQEVNELLQKFQ 

QAQEELAEMKRYSESSSKLEEDKDKKINEMS 

ICEVTKLKEALNSLSQLSYSTSSSKRQSQQLEA 

LQQQVKQLQNQLAECKKQHQEV1SVYRMHL 

7 YAvnfinMnpnvn^vT v r^n T\AC*\£>dc\<z.r\\t 

L, l r\v ^UvMJJdL' v v^rv VLrlVV^lL. 1 JVl^K.iNV^ov ( >K 

IC 


1065 


2415 


A 


SS41 


3 


663 


AAATAASLSPRGCRLRTPS SD VGPSRAPPPSA 

APLPTGRAQMSPSGRLCLLTIVGLILPTRGQTL 

ICDTTSSSSADATIMDIQVPTRAPDAVYTELQP 

TSPTPTWPADETPQPQTQTQQLEGTDGPLYT 

DPETHKSTKAAPIPTDDTTTLSERPSPSTDVQT 

DPQTLICPSGFHEDDPFFYDEHTLRKRGLLVA 

AVLFITGIIILTSGKCRQLSRLCRNHCR 


1066 


2416 


A 


8853 


3806 


2204 


FVGEQEG G CE AG AGRGAQT YPGE AGER \VFG 

RRRRRGRVVSRKKMSLKSERRGIHVDQSDLL 

CKKGCGYYGNPAWQGFCSKCWREEYHKAR 

QKQ1QEDWELAERLQREEEEAFASSQSSQGA 

QSLTFSKFEEKICTNEKTRKVTTVKKFFSASSR 

VGSKKEIQEAKAPSPSINRQTS1ETDRVSKEFIE 

FLKTFIIKTGQEIYKQTKLFLEGMHYKRDLSIE 

EQSECAQDFYHNVAERMQTRGKVPPERVEK1 

MUQIhK. 1 1M 1 K1.YKY VFCPETTDDEKIvDLAI 

QKRIRAJLR\WTPQMLCVPVNEDIPEVSDMVV 

a TTTMTTT K vfT\ C L' D \/DD ni'T A r*VTV PCL'UI rrx T a T 

r^llJUllliivLU^KK VrKLJlsJLACl i KUoKrilr NA1 
T^TTk"MFPA<?Ar>nPT PT1 FVTVT k'^MPPPT r\QXIT 

OYFTRFPTsIP^RT MTriPDr.YVPTMT rrAVAPIP 
\^ ill ivr vini oi\ t jivx x \J illjkj i i r j in juv_-v^/-\ v /w iii 

KLDAQSLNLSQEDFDRYMSGQTSPRKQEAES 

WSPDACLGVKQMYKNLDLLSQLNERQERIM 

NEAKKLEKDLEDWTDGIAREVQDIVEKYPLE1 

KPPNQPL AAIDSEN VENDKLPPPLQPQ V YAG 


1067 


2417 


A 


8855 


1372 


1513 


SNMREVGCGWLVPVIPAFWEAEVGGSLEARS 
LRQAWATKQDPISKKK 


1068 


2418 


A 


8856 


1530 


1583 


PCRPGMECNSMISVHCNL 


1069 


2419 


A 


8857 


1530 


1583 


PCRPGMECNSMISVHCNL 


1070 


2420 


A 


8866 


293 


1675 


PYPQGGYPQGPYPQEGYPQGPYPQGGYPQGP 

YPQSPFPPNPYGQPQVFPGQDPDSPQHGNYQ 

EEGPPSYYDNQDFPATNWDDKSIRQAFIRKVF 

LVLTLQLSVTLSTVSVFTFVAEVKGFVRENV 

WTYYVSYAVFF1SLIVLSCCGDFRRKHPWNL 

VALSVLTASLSYMVGMIASFYNTEAVIMAVG 

11 1 AVUr 1 v virbiVl^l KYUr J buMOVlJLVolvl 

VVLFIFAILCIFIRNRILEIVYASLGALLFTCFLA 

VDTQLLLGNKQLSLSPEEYVFAALNLYTDIINI 

FI,YILTnGRAKE*PSSSSLCPLRWHGWPGPCP 

U/TJnC A QPT5PT Q PDA A f\L>]> U"V r\ A <J 1 / AIJOr^N /tV 

TADTSIWTRCGHSMAPLVLPPPPRGTKATFPC 
HLLSTHCCMSPVCQPTPGTGGSTRSRGEGLSQ 
EVRVHVFPPVPAPQPGVEHPSPPPHPPGVLPS 
GDMRSGGLIPVLSPE 


1071 


2421 


A 


8868 


2 


358 


ARGNTLYHLPRLCRKLNLRWFSASTLYDVQH 
DDKMGSNTFFKRNDCRYVN'nSCKADMAYDN 
VRHPFMI*SI\KLIMEETYLNIIKAVYDRPTASI1 
LNGEKLKVFPVRSGT*QGCSVWP 


1072 


2422 


A 


8870 


33 


658 


MESVLSKYEDQITIFTDYLEEYPDTDELVWIL 
GKQHLLKTEKSKXLSDISARLWFTYRRKFSPI 
GGTGPSSDAGWGCMLRCGQMMLAQALICRH 
LGRDWSWEKQKEQPKEYQR1LQCFLDRKDC 
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SEQ ID J 
NO: of 1 
nucl- 1 
eotide 5 
seq- i 
uence 

l 


SEQ ID 1 
MO: of 
peptide 
>eq- 
jencc 


Viet : 
nod 


SEQ 1 3 
[D NO: 1 
n i 
USSN 
09/496 
914 


Predicted I 1 
beginning i 
nucleotide 
ocation 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

l—\ A onartin A f*\A P — Oil ltfUTH O Acifl 

J=A.SpaniC /\C1Q, Cj — VJlULOJUiV/ n^m, 

F=Phenylalanine, G=Glycinc, H=Histidine, 
I=lsoleucine, K-Lysinc, L=Leucine, 

Tv/f*»thinnint» "N=A«;nara{?ine P— Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan, 
v—Tvm<=inp X=Unknown. *=Stop codon. 
/=possible nucleotide deletion. \=possible 
nucleotide insertion 














CYSIHQMAQMGVGEGKSIGEWVLGPNTVWVg 
G V * KNLA\LFDEW\NSLGL V Y VSMVDNPSGSl A 
RFPKKLCRVLPLNSADTAGLTGP 


1073 


2423 


A 


8879 


146 


412 


DFSV*GDVDIEVTCPICLQLLTEPLSLNCGLRL 
*QVCITA*IKESVIISGG*SSSPVCHTTFQPANL 
RTSRYLPT* SIKSLGPDEPQEG 


1074 


2424 


A 


8884 


67 


435 


HLQGRSIRTLQLTGENEKNCEVSERIRRSGPW 
KEISFGDYICHTFQGDCWADRSPLHEAAAHG 
RLL ALKTLI AQGVNVNLWTL/DRV S SLHEACL 
* GP V AC AJCPY WKM VPRHG GT VTGPPLLM V 


1075 


2425 


A 


8896 


1294 


248 


RSGDRNGLTHQLGGLSQGSRNQSYRSRSRSR 

SRERPSAFRGIPFASASSSVYYGSYSRPYGSDK 

PWPSLLDKEREESLRQKRLSERERIGELGAPE 

\ rvi ;r-T ooi^MPFT>nc;nPTTrP\^r>FEPICKSTTSAS 
VWuLor JxNrfci UoJJt'n i r \ £,ul^iz,l ivjn.o 1 x ur\^> 

TSEEEKiQCKSSRSKJERSKKRRKKXSSKKICHK 
KYSEDSDSDSDSETDSSDEDNICRRAKKAKKK 
EKKKKHRSKKYKXJCRSKKSRKESSDSSSKES 
QEEFLENPWKDRTKAEEPSDLIGPEAPKTLTS 
QDDKPLNYGHALLPGEGAAMAEYVKAGKRI 
PRRGEIGLTR*RNCHHLNAQVM* * WSRHRR 
\a~e a \tt>t AifR ppp*nTVT MRREPLHPFNPRRET 

KERE 


1076 


2426 


A 


8899 


146 


789 


GRSTEAEKJEPAFDERTGKGRRLPRAGEFHG* E 
* APGPGPRSFQVSRKMPEEVPPGARKHPFSGKS 
T?vT ni DArvwi OPT TGAIOCTLGGVIEGFLSKE 
VSYIVSSRREVKAESSGKSHRGCPSPSPSEVR 
VETSAMVDPKGSHPRPSR1CPVDSVPLSRGKE 
t T ott ATT?-MOK:**rTVOOLSHCRLY\GEKTTAK 
RSOREHVQQQSQEHGKWPDLKGPR 


1077 


2427 


A 


8901 


352 


3 


AKIGAYKYIQELWRKKQSDVMHFLLRVRCW 
ovpat HRAGTEWOLSALHRAPRSTQPDKAC 
RLGYKAKQGYIIYRICVRRGGWKCPVPKAVT 
\YGKPVHHGVN*LKFAQSLQSVAEEQ 


1078 


2428 


A 


8905 


536 


781 


"ACPAENREVPEMAAGQAPHAGPGAGPGQPA 
PALPFAATPGSRGQALCRGGRRRQHLHGPLH 
RP* OAAPALHAGCQLAPHPPT 


1079 


2429 


A 


8912 


121 


376 - 


NLIWKLCVTERRLVILDNYDLASEA'EANKYI 
CNRIIQFKPGQDKYFTLGLPTGSTPL*CYPKLI 
EYNKNGHLiSFKYVKTFSMDEY 


1080 


2430 


A 


8920 


381 


1788 


" SSESPSDPGRMAMTW1VFSLWPLTVFMGHIG 
GHSLF SCEPITLRMCQDLP YNTTFMPNLLNH Y 
DQQTAALAMEPFHPMVNLDCSRDFRPFLCAL 
Y APICME YGRVTLPCRRLCQRAYSECSKLME 
MFGVPWPEDMECSRFPDCDEPYPRLVDLNLA 
GEPTEG APV AVQRD YGF WCPRELK1DPDLGY 
otri tj\rDr»r , QPPr , PMMYFRRFFLSFARYFIGLlS 
nCLSATLFTFVTFLIDVTRFRYPERPIKCYAV 
WHMMVSLIFF\IGFLLEDRVACNA\SIPAQYKA 
STVTQGSHNKACTMLFMILYFFTMAGSVWW 
VILTITWFLAAVPKWGSEAIEKKALLFHASA 
WG1PGTLTIILLAMNK1EGDNISGVCFVGLYD 
VDALRYFVLAPLCLYVVVGVSLLLAGIISLNR 
VRIEIPL*KENQDKLVKFMIRIGVFSILYLVPLL 
VVIGCYFYEQAYRGIWETTWIQERC 


1081 
10 j2 


2431 
2432 


A 
A 


8922 
1 8923 


56 
355 


420 
1079 


EERTKM STGPD VK AT VGDI SSDGKLN V AQEE 
CSRKGIVDEFFPLLSN* CIWTQPQGYPQSSYG 
TLANFVFVCSVRHGLALILQLCNFSIYTQQMN 
LSIA1P AMVNKTAPPSQPN ASTERP ST 
PFGTPSSTMAVVK^CLMKGGKKGVKKKVV 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E-Glutamie Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
l=lsoIeucine, K==Lysine. L=Leucine, 
M=Methionine, N=Asparagine, P-Proline, 
Q=Glutarnine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, ^possible 
nucleotide insertion 














GPFSKKDQYDVKAPAMFNIRNTGK/TLVART 

QGTQIASDGLKGLLFEVSLADLQNDEVAFRK 

FKLITEDVQDKNCLTNFYGMDLTCDKICSMV 

EKWSTMIEAHVDVKTTDGYFFHLFCVGFTKK 

HNNQILKTS YA*HQQS/RQIQKKMME1MT* EV 

QTNDLKEVVNICLIPDNIGICDTEKV/CPIYPLH 

DVFIRKVKMLENPGFER\N4ELRGGGSSS 


1083 


2433 


A 


8948 


28 


385 


LTWPQPHIPSCPAMSEETLQSKLAAAK1CKLP 
WGAVQGSRAMSDLLLLLLDLTLLLLLMLLGF 
AGYSGQLAGVAVSAGSPPI/RYKFHVEPYGET 
GWLLT/ESCSISPKLCSIAVH*DNPAWF 


1084 


2434 


A 


8950 


156 


31S 


HYTP1NTDTIENSENNKCW*GY*E\VGUHHW 
WGGKRVQPFWICRVWQKRTLNLRV 


1085 


2435 


A 


8956 


16 


413 


HMGQLG YFIQC W WECKRLI SF\ WKTI* QSPAK 
*TIYTSYDTA1PIS/GI/YPKRMSSKCHQETCAR 
MFILAPFTATIKGKQLTCPLVEERIDYVMWYS 
HKY YIKVKRNL* VTITHYTW VNLNILMFEIIL W 
YSHKYY 


1086 


2436 


A 


8962 


868 


1026 


H* KILQ VGRAQRAHX SRL* SQLLRRLRHESHL 
NPGARGCSEARLHRCTPAWTT 


1087 


2437 


A 


8985 


58 


330 


LHVKHLGHFQLVFSEVICHCILMPVS*ELQRL 
* ERS VC AFH VCIQTY VCLQ V Y ACMC V Y YICM 
FVYSVYGCGLCTCVCMDVYICVCVQEFL 


1088 


2438 


A 


8989 


394 


404 


N*KWILHVNVRJQS1FF/IICRNQK/1NSHELKLD 
KKFLDMM SNA* STKKHDKLD/LIKFKT/LC S A 
IC YT VKRIKIHPTDLEKMLRNHL SDKD * YS/G V 
YKDLSKLNRRKTE/S* /VKXWVKDLSRYFIKE 
VI SMENKHKKIFSTS 


1089 


2439 


A 


8991 


60 


329 


MALTPESPSSFPGLAATGSSVPEPPGGPNATL 
NSSWDSPTEPSSLEDLEATGTIGTLLSDMGVV 
GVEDNAYTLEVNSRYMRAVGIM*IHL 


1090 


2440 


A 


8996 


2 


351 


SNITITLT*MKKYDNTFCW*GCGQIG/T/LIYC 
WQESKFIQAFWSKIQQYLA*ISIHILFDPAFLFL 
G G YPGGTQS VFLTG VL VSS VP YNMKMLHTR 
LLIAALFIIVQYWKQSKDHYI 


1091 


2441 


A 


8997 


97 


456 


YPLPVCSYLSGPRGEHWNSLGGKSSCPLPLPT 
LVSSRFKISKVIVVGDLSVGKTCLINR*GGAG 
AELGRVGPSLARWAGSRSQHLVPSQWCKDS 
FDKNYKAPIGADFEMERFEVLGIPF 


1092 


2442 


A 


8999 


548 


811 


SSFIKRHILIFEDDWHQTTCCHHPHHP\F*RCQ 
FHIFYVSVQNSISPSLSVSSSHPDRPDHEVHQH 
RAAI iHHQHG QGPLGHGL V AR VG 


1093 


2443 


A 


9002 


3 


2745 


ALLGLQQPAQSLILSRSSVMGVRGLQGFVGS 

TCPHICTVVNFKELAEHHRSKYPGCTPTIWD 

AMCCLRYWYTPESWICGGQWREYFSALRDF 

VKTFTAAGIKL IFFFDGM VEQDKRDE WVKRR 

LKNNREISRIFHYlXSI-nCEQPGRNMFFIPSGLA 

VFTRFALKTLGQETLCSLQEADYEVASYGLQ 

HNCLG1LGEDTDYLIYDTCPYFSISELCLESLD 

TVMLCREKLCESLGLCVADLPLLACLLGNDII 

PEGMFESFRYKCLSSYTSVKENFDKKGNIILA 

VSDHlSKVLYLYQGEKKLEErLPL/VTKQSSFL 

*RNGIISFTRT/INLHGFSKNPKV**LWTNK*YP 

RVQTPNPGKKFPCVQMLNPGKKFPCVQALNP 

GEKFPCIHI/PEPRQEVPTCSDPEPRQEVPTCTG 

PESRREVPMCSDPEPRQEVPMCTGPEPRQEVP 

MCTGPEARQEVPMCTDSEPRQEVPMCTDSEP 

RQEVPMYTGSEPRQEVPMYTGPESRQEVPMY 

TGPESRQEVLIRTDPESRQEIMCTGHESKQEV 
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NO: of 
nucl- 
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seq- 
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SEQ ID 
NO: of 
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seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
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914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D^Aspartic Acid, E=GIutamic Acid, 
F=Pheny!aIanine, G^GIycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
MNMethionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon. 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














PICTDPISKQEDSMCTHAEINQKLPVATDFEFK 

LEALMCTNPEIKQEDPTNVGPEVKQQVTMVS 

DTEILKVARTHHVQAESYLVYN1MSSGEIECS 

NTLEDELDQALPSQAFIYRPIRQRVYSLLLED 

CQDVTSTCLAVKEWFVYPGNPLRHPDLVRPL 

QMT1PGGTPSLK1LWLNQEPEIQVRRLDTLLA 

CFNLSSSREELQAVESPFQALCCLLIYLFVQV 

DTLCLEDL1 IAFIAQALCLQGKSTSQL VNLQP 

DYINPRAVQLGSLLVRGLTTLVLVNSACGFP 

WKTSDFMPWNVFDGKLFHQKYLQSEKGYA 

VEVL/CRTK*ISAHQIPQPEGSRLQGLHEGEQT 

HHWPSPLGLTPRREVGKTGLQLPQDGLWV 


1094 


2444 


A 


9021 


97 


834 


AREACRAKTDFPGRRFRLWPSCCCRVIVGAE 

T* HVMAEPVSPLKHF VL AKKAITAIFDQLLEF V 

TEGSHFVEATYKNPELDRIATEDDLVEMQGY 

KDKLSIIGEVLSRRHMKVAFFGRTSSGKSSVI 

NAMLWDKVLPSGIGH1TNCFLSVEGTDGDKA 

YLMTEGSDEKKSVKTVNQLAHALHMDKDLK 

AGCLVRVFWPKAICCALLRDDLVLVDGPGTD 

VTTELDSWIDKFCTKSSTREITNSGSDT 


1095 


2445 


A 


9022 


1 


537 


LVLNSRVEDFVPPEGAGRTLPFALRPLAACW 
LLHRRARJR-SSALCPRPRSWGVSGGEGAGARE 
P*ITSSSCCLSAA/SHLSIQSPNMAGARRRIRPQ 
LAKEKIEGCHICTSVTPGEPQVFLGKDKAFTF 
DYVFDIDSQQEQIYIQCIEKJLIEGCFEGYNATV 
FAYGQT\GAGKTYTMGTGFD 


1096 


2446 


A 


9029 


1 


285 


FFFFNVCKSPKVPKPGCKEESTGTLFKNTLISL 
GQHSETPSLKKK\LAGYSGMCL*SQVLRRLRQ 
EDCLSPGGGNCRES* SCPYTPAWITERDPV 


1097 


2447 


A 


9032 


716 


357 


ARSTGFWGEILWCGFLKRSLALSPRVKCSGAI 
LAHCNFRHAGFPPLSCLSLPNRWEYRRPPARP 
GKFFLVFLVETGFQC/G*DGLDLLTSRSACLG 
LPKCWDYRREPAASnFQTTFFINSK 


1098 


2448 


A 


9038 


230 


652 


KVVVMSCEDINISGSFYRNKLKYLAFLCKRTS 
TNPSQGP YHL WVPSHIF WQTTCGRLPHK i KQ 
G* AALDHLKVFDRIPLPYDKKKQMAVS ATLE 
V VRPKP* RKF AYLG H\V AQKVD WKYQAMT A 
TMGEKRKVYYQKICYQKJC 


1099 


2449 


A 


9043 


185 


372 


I1FYSHQQCMRV/WQGCGDIETLIHCW*E*KII 
HSL/WK7TV* QFLKRLYLHLPHNS VIAFLGISP 
RKIKTCPQNSCTSMLINAIHNDQKWKKINI 


1100 


2450 


A 


9045 


763 


584 


RQSLALSPRLECSGTISAHCRLCPLVFTPLSCL 
SLTSSWDYRRPPPHPANFLYFK*RRGF 


1101 


2451 


A 


9050 


275 


2 


LFFLRKVSNQFLSPSLLPVNFQGFVFAFLLLLL 
FLL/FEMESLP VA/RVECSGTISAHCNLCLPG SS 
DSPASAS*VAGITDMCRYTQLILFHAS 


1102 


2452 


A 


9053 


449 


1224 


KTSMFWKFDLHSSSHIDTLLEREDVTLKELM 

DEEDVLQECKAQNRKLIEFLLKAECLEDLVSF 

IV+EEPPQDMDEKJRYKYPNISCELLTSDVSQM 

NDRLGEDESLLMKLYSFLLNDSPLNPLLASFF 

<?ir\/T <m I^RKPFOTVOFLKICKHDFVDLIIKHIG 

TSAIMDLLLRLLTCffiPPQPRQDVLN/WFKVQ 

RNL*HST*NVMDISKYVNLHWGLNKSHSLL* 

LLLQCVLQWLNEEKHQRLVEIVHPSQEEDVS 

SLV 


1103 


2453 

1 


A 


9058 


403 


3 


GLHV YDFQ V YREHILTLN VKKCS VSF WGLRE 
WLYLQM YEHKSPRFPnKMTDITKC W* GQGA 
AGMQI/H/C W\WC VN VGKF WEMS * YYLLKLSI 
ST/PYDPAIPLLGIYL* ETRV YIHPFCTCMRMLIA 
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SEQ ID 

VIA. 

NU. oi 
nucl- 
eotide 
seq- 
uence 


SEQID 
NU. Ot 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine s 
D=Aspartic Acid, E=G!ulamic Acid, 
F=Pnenylalanine > G=Glycine, H=Histidine, 
l=Iso leucine, K=Lysine, L^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














APFVLAVNC 


1 1 CiA 


OA^A 


A 
/A 




/ J 




KWLrbo.LJNi 1 OKU UllOrlLKWLDCKXNCbSr PI 
KRNRQTHSTESNKLKAGH SFG YN * LIH* N S\ V 
K 1 ULULUAN blvU V V V VMKV\K I AQQivQTTS 
YMQIGTTKN SRAT 


1105 


2455 


A 


9065 


366 


778 


DLLILRNLAFPELKRJINCISRFYLAYHLHKIYS 
RSILLCNNC S G F YILSL* Q YD VFFFNYFFFRDR 
AWCCPGWSAAWLTIVILAHYRRPGLERSCC 
LSLSSSWDHRRVPPCPANF*/YFSMGFTAFPRL 
VLNS*TQGI 


1106 


2456 


A 


9083 


673 


816 


ESGSLIH* WWENKPAQPLWWEI*QH VQKLPT 
HFPCDPAIPLLG ICPED 


1107 


2457 


A 


9086 


580 


18 


KPSSGSFIRAIYIFLSTAHVPALFSVLVRTKLT* 

AFSQSSVLWAHKQQKTSLSLVIR/ERLQIKTA 

VRENFLPIRLAKJLKLDNVKCWQG/SGSNMSL 

I/HCWWEYNVIHnVN^SVTFPRKVEHVYITYA 

PEISVR*IHGGLPTLVHQETHTSVFRGAPSVIP 

ETR\CRPTKESINkLLHI YTMEHYGD ENK 


1108 


2458 


A 


9093 


540 


1 


GGNDCSVTPTTEPGRKEIT*1CRK.F*EKTDRLP 

GA/PPSRTPPTPYPCPHGDRLLPPSRPLPAGPA 

SAFPPAERSRGHRRASL*RARWSAAVPRRSA 

GSASEPVQSRWLRLPVGSDSPPAVPVRVCPAP 

Do \<r AAr O bKJLl 'JJ ryj L.D b r APb R TPS b b V D *GG 

QRPPPPSGDSLSPPGCCRY 


1309 


2459 


A 


9099 


1255 


1425 


HESYHVNPNLCNPVAPTSGAHSIG*KWPSVVL 
GAVAHSCNPSTJLVGRGGR1TRGQELR 


1110 


2460 


A 


9103 


242 


70 


EEQFFFFAVGMFP*VDFLAPASGELWDRLRLT 
CSRPFTRHQSFGLAFLRVCSSLDSLDDSVVGP 
SALLSSVIVNQGGRNVLEAREAAKHPTI*RQS 
LLRKQRNKRMAIP 


1111 


2461 


A 


9110 


189 


121 


SFLS VRLECN G A1MAHC ALPLPG 


1112 


2462 


A 


91 13 


100 


910 


RRRGG GS RPRRTP VPAPGPGPSFGMD VRF YP 

AAAGDPASLDFAQCLGYYGYSKFGNNNNYM 

NMAEANNAFFAASEQTFHTPSLGDEEFEIPPIT 

PPPESDPALGMPDVLLPFQALSDPLPSQGSEFT 

PQFPPQSLDLPSITISRNLVEQDGVLHSSGLHM 

DQSH1 QVSQYRQDPSLIMR\PSST*PDAARSG 

VMPPAQLTTINQSQLSAQLGLNLGGASMPHT 

SPSPPASKS ATPSPS S S INEED ADEANRA IG EK 

RAAPDSGKKPKTPKK 


1113 


2463 


A 


9120 


3452 


3051 


FLRPSF AL VPQ AG VQWCALS WLQPPSPRFK* F 

SCLSLPSSWDYRHVPPRPANFFVLLVETGFLH 

VGQAGHEPLTSGDPPASASQSAGITGVSHQA 

WPSFFIFSRDT\ 7 LLCCSGWSRTSGLKQSACLS 

LLKCWUY 


1114 


2464 


A 


9122 


152 


377 


NQLPLQQWTFFIYETGFCSVAQAGYQCRDHS 

SLHP*PPG\SSDPPAPPS*VJLGITGQRYHACLII 

YLYVQTVTQRV 


1115 


2465 


A 


9124 


553 


981 


QRPLLRQQLGSWPTCRSLEGDLASP\V**RLPG 
SPRMRRSGT/ATLNLPLSPQGTVRTAVEFQVM 

WRLQHKGRGRGDLHJLPDHHLSVPSSADHPA 
QQPSQFNGRNLYFLPLFR 


1116 


2466 


A 


9135 


48 . 


410 

1 


SASIIEPAEHDGGADSLSASQPPRPAGRPAGA 

QHVHVPPWTDVLAGQDRRAPTAGDGAPWP 

APGGHVPSTRPHDPAEFHADEAAGRGGRGLQ 

PAAPHALPAGLPHGPPAPA/PAEGGGTP*GSA 

GAGGP*GSPAGRACGAAGCRPRPPRPAASSA 

*NSAGS*GLVEGT*PPGAGHGAPSPAVGARLS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartie Acid, E=Glutamic Acid, 
F=Phenyiaianine, G^Glycine, H=Histidine, 
I=Isoieucine 3 K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q-=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V= Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














CPARTSVQGGTWTC*APAGRPAGLGGWEAE 
RESAPPSCSAGS*DAD*GAEPWGAGSRS\VGS 


1117 


2467 


A 


9141 


380 


939 


KSGHWAKECLQPRIPPRPCPICVGPHWKSDCP 

TCPGAVPRAPGTLPQGSLTDSFPDLLSLVAED 

* CCLMASEAS WTITVELWVTLTVEGKSVP/CL 

NTEATHSTLPSFQGPVSLASITWGIDGQASKP 

LKTPQLWCQLGQYSFMHYFLVIPTCPVPLLG* 

GILTKLSAFLTIPRLQPHLIAALSPSS 


1118 


2468 


A 


9154 


471 


2 


AAGQVVVEVTSHLYLCITSDAAGLRLLPPAES 
ERG EGGHCPAE APLPPRPQ YCL AKHPLLRKLP 
EEKIKLDPYLTQHTKINSKQIKYLS/VRARTTQ 
LVEGNIGVNLQNTELKQII*INGFLDTTPEAQE 
TKEKTNKLNFn<KVKRQLAEWEKIFQIA 


1119 


2469 


A 


9155 


2 


3187 


ACPRLARRRRRVRSLRRRRGWLRARWSRGQ 

NNMAARRJTQETFDAVLQEKAKRYHMDASG 

EAVSETLQFKAQDLLRAVPRSRAEMYDDVHS 

DGRYSLSGSVAHSRDAGRESLRSDVFSGPSFR 

SSNPSISDDSYFRKECGRDLEFSHSNSRDQVIG 

HRICLGHFRSQDWKFALRGSWEQDFGHPVSQ 

ESSWSQEYSFGPSAVLGDFGSSRLIEKECLEK 

ESRDYDVDHPGEADSV/LRGGSQVQARGRAL 

NIVDQEGSLLGKGETQGLLTAKGGVGKLVTL 

RNVSTKKIPTVNRITPKTQGTNQIQKNTPSPD 

VTLGTNPGTED1QFPIQKIPLGLDLKNLRLPRR 

KMSFDIIDKSDVFSRFGIEIIKWAGFHTIKDDIK 

FSQLFQTLFELETETCAKMLASFKCSLKPEHR 

DFCFFTIKFLKHSALKTPRVDNEFLNMLLDKG 

AVKTKNCFFEIIKPFDKYIMRLQDRLLKSVTP 

LLMACNAYELSVKMKTLSNPLDLALALETTN 

S LCRKSLALLGQTF SL AS SFRQEKIL* A VGLQ 

DIAPSPAAFPNFEDSTLFGREYIDHLKAWLVS 

SGCPLQVKKAEPEPMREEEKMIPPTKPE1Q.AK 

APSSLSDAVPQRADHRWGTIDQLVKRVIEGS 

L SPKERTLLKEDPAY WFLSDENSLEYKY YKL 

KLAEMQRMSENLRGADQICPTSADCAVRAML 

YSRAVRNLKKKLLP\WQRRGLLRAQG\LRG\ 

WKARRA\TTGTQTLLFLRAPGLKHHGRQAPG 

LSQAKPSLPDRNDAAKDCPPDPVGPSPQDPSL 

EASGPSPKPAGVDISEAPQTSSPCPSADIDMKT 

METAJiKLARFVAQVGPEIEQFSIENSTDNPDL 

WFLHDQNSSAFKFYRKKVFELCPSICFTSSPH 

NLHTGGGDTTGSQESPVDLMEGEAEFEDEPP 

PREAELESPEVMPEEEDEDDEDGGEEAPAPG 

GAGKSEGSTPADGLPGEAAEDDLAGAPALSQ 

ASSGTCFPRKRI S SKSLKVGMIPAPKRVCL1QE 

PKGECPPVGTVASSTVLGW WAVRVRRDRWR 

HFNPKEFCAPLQNVSRHSCFPVV 


1120 


2470 


A 


9163 


124 


207 


PPRACRPCPRACPCPPT*KCSQPVSWPC 


1121 


2471 


A 


9166 


272 


523 


PMSSLQGCFYTFKCIIFKGIFLLTJSNLIAF**EK 
V/CSF1ITDSLICFIGKGWVGMVTHACNPGTLG 

VJ VJVJf W LrV V iVIZ/i Hi I OL^yJi^xVl 


1122 


2472 


C 


9170 


442 


236 


MNRRRFLRPADCHSGMRGTENGACSEGESQI 
HCGAOGEGVQLVHVVNQPENGCLQFDSTHIT 
FSKRQN* 


1123 


2473 


A 


9171 


10 


423 


' MVDRSPLLTSV11FYLAIGAAIFEVLEEPHWKE 
AKKNYYTQKLHLLKEFPCLGQEGLDIOLEW 
SDAAGQGVAITGNQTFNNWNWPNAMIFAAT 
VITTIGYGNVASKTPGGRLFCGFYGLFGVPFC 
LTWINALGKFFG 



307 



NiSDOCID- <WO 0157188A2_I_> 



WO 01/57188 




T/USO 1/03800 



SEQ ID 

MU. Ot 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

ri\J. 01 

peptide 
seq- 
uence 


Met 
noa 


SEQ 
ID NU- 

in 

USSN 
09/496 

Q1 A 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid. E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
l=lsoieucine ? K-Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine. R=Argintne, S=Serine, 
T=Threoninc : V=Valine, W=Tryptophan. 
Y=Tyrosine, X=Unknown, —Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 


1124 


2474 


A 


9173 


3 


374 


GPSPSLLVLLPQEPGGTGTPVRAGAGAGMWL 
YV tUQOOLrLOi'r br LMi^MLLLbTRNPVN ACLL 
I ubLr VLLU Vrbrtr VJ^bCKAbQbLrLrKDKISA 
1AHRGGRHDPPENTLGATR/QGS**WSNRR 


1125 


2475 


A 


9179 


704 


188 


ESSSGLLFQCFQGIHVQKLTLQARPTLFSWWL 
CSKPPKETGELENAESGGDGGRRGGKQDNV 
A W WKl<JVi\<^K\jr\lJ.r rWUUbUr 1 QSOPFGGQA 
LPMGFFYLYFRDPGREITWKHFVQYYLARGL 
VDRLE V VNKQS VR VIPAPGTS SE VRGEFK AE 
YCRHKFISCKNVVFYFFQ 


1126 


2476 


A 


9183 


153 


233 


MEYMAESTDRSPGH1LCCECGVPISPN 


1127 


2477 


A 


9185 


1 


321 


LTGQLG SILLR VFSKSRAGLG ARKLKA YRTM 
EYMAESTDRSPGfflLCCECGVPISPNPAQY\CV 
ACLRSSFHIYHCIPKLFIHPFSKTSSSAFITPSHY 
LTFFSTIS 


1128 


2478 


A 


9186 


183 


847 


VLKFLLLQTMDEQSQGMQGPPVPQFQPQKAJL 
RPDMGYNTLANFRIEKJC1GRGQ\FSEVYRAAC 
L\LDGVPVALKKVQIFDLMDAKARADCIKEID 
LLKQLNHPNVIKYYASFIEDNELNIVLELADA 
GDLSRM1KHFKKQKRLIPERTVWKYFVQLCS ; 
ALEHMH SRRVMHRDIKPANVFITATGVVKLG 
DLGLGRFFSSKITAAHSLVGTPYYMSPERIHD 
NO 


1129 


2479 


A 


9190 


1 


370 


GTSVVKIPSAAVSESSPNGAAYASGLPCGVRG 
PPWAGLALLPSPTLMAIXRRPTVSSDLDNIDT 
RATT\KIRVVAT1TRARIEDMRFISATALTRPD 
ATTAQIPKLPVTTVCNRRANPGIPPSVL 


1 1 ic\ 




A 

A 




131 


487 


AYLKR1.PVPESITGFARLTVSEWLRLLPFLGV 
LALLGYLAVRPFLPKKKQQ1CDSLINLKIQKEN 
PKVVNEINIEDLCLTKAAYCRCWRSKTFPAC 
DGSHNK HNELTGDNVGPLILKKKE 


1131 


2481 


A 


9201 


184 


605 


KELVDEKSERGRAIVIDPVSQLASAGTFRVLKE 
PLAFLRALELLFAIFAFATCGGYSGGLRLSVD 
CVNICTESNLSIDIAFAYPFRLHQVTFEGVPTCE 
GKERHKLALIGDSSSSAEFFGTVAGFAFLYSL 
AATG V YIFFQN K Y 


1132 


2482 


A 


9206 


1 


852 


GGGRAGAGSRJ3MGSTDSKLNFRICAVIQLTTK 

TQPVEATDDAFWDQFWADTATSVQDVFALV 

PAAEIRAVREESPSNLATLCYKA\rEKLVQGA 

ESGCHSEKEKQIVLNCSRLLTRVLPYIFEDPD 

WRGFFWSTVPGAGRGGQGEEDDEHARPLAE 

S LLLAI A D LLFCPDFT VQ SHRRST VD S AED VH 

SLDSCEYIWEAGVGFAHSPQPNYIHDMNRME 

LLKLLLTCFSEAMYLPPAPESWQH/RTHWFSS 

FVSSENRHALPLFTSLLNTVCAYDPVEYGIPY 

NHLY 


1133 


2483 


A 


9208 


1165 


1463 


GPRARVQGFSGADIVKFMALGSMYLX^LTLIV 
AKVLRGAEPCCGPLKNRVLRPCPLP/VPLPPP 
HPQPS RGNP VGCLPTYKV VYICLLS WPLHSNS 
NVYFIV 


1 134 


2484 




9^10 




I JOO 


1 VI /\ U A O r JSJvKAJL o Al V Ab fc, Ivb tA l\±L R i M AAlv 

RADGAAPAGEGEGVTLQGNITLLKGVAVIVV 

AIMGSGIFVTPTGVL1CEAGSPGLALWWAAC 

GVFSIVGALCYAELGTTISKSGGDYAYMLDV 

YGSLPAFLKLWIELLIIRPSSQYIVALVFATYL 

LKPLFPTCPVPEEAAKLVACLCVXLLTAVNC 

YSVKAATRVQDAFAAAKLLALALIILLGFVQI 

GKGDVSNLDPNFSFEGTKLDVGNIVLALYSG 

LFaY.GGWNYLNFVTEEMINPYRNLPLAIIISLP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Asparttc Acid, E=Glutamic Acid, 
F=Phenylalanine, G^Glycine, H=Histidine, 
l=Isoleucine, K^Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine 5 
T=Threonine, V^Valine, W=Tryptophan, 
Y=Tyrosine, X=Unkno\vn, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














IVTLVYVLTNLAYFTTLSTEQMLSSEAVAVDF 

GNYHLGVMSWIIPVFVGLSCFGSVNGSLFTSS 

RLFFVGSREGHLPSILSMIHPQLLTPVPSLVFT 

CVMTLFYAFSKDIFSVINFFSFFNWLCVALAI1 

GMIWLRHRKPELERPIKWLALPVFFILACLF 

LIAVSFWKTTPWSVASDFTIILSGLPVYFFGV 

WWKNKPICWAPPGHLSPRPSCVRSSCMVVPQ 


1135 


2485 


A 


9216 


40 


410 


RDRLPPAYFCRPVVCVVTALDVGVSPESQEM 
DLVAFEDVAVNFTQEEWSLLDPSQKNLYREV 
MQETLRNLASIGEKWKDQNIEDQYKNPRNNL 
RSLLGERVDENTEENHCGETSSQIPDDTLNK 


1136 


2486 


A 


9223 


3 


983 


RRRRRSRYRRCSRFPRPGPLAVSMPHAFKPG 

DLVFAKMKGYPHWPARIDDIADGAVKPPPN 

KYPIFFFGTHETAFLGPKDLFPYDKCKDKYGK 

PNKRKGFNEGLWEIQNNPHASYSAPPPVSSSD 

SEAPEANPADGSDADEDDEGXRGVMAVTAVT 

ATAASDRMESDSDSDKSSDNSGLKRKTPALK 

MSVSKRARKASSDLDQASVSPSEEENSESSSE 

SEKTSDQDFTPEKKAAVRAPRRGPLGGRKKK 

APSASDSDSKADSDGAKPEPVAMARSASSSSS 

SSSSSDSDVSVKJCPPRGRKPA£KPLPKPRGRK 

PKPERPPSSSSSD 


1137 


2487 


A 


9229 


21 


239 


LFPRLECRDPVTVNCTLNLPGSKNAPTTASQV 
GSTWNYRGGLPHPTNFFVKTGFRCSQAGLKL 
RGSREPPAWA 


1138 


2488 


A 


9231 


1664 


2 


TRSVGVNTCEVGWTEPECLGPCEPGTSVNL 

EGIVWHETEEGVLWNVTWRNKTYVGTLLD 

CTKHDWAPPRFCESPTSDLEMRGGRGRGKR 

ARSAAAAPGSEASFTESRGLQNKNRGGANGK 

GRRGSLNASGRRTPPNCAAEDIKASPSSTNKR 

KNKPPMELDLNSSSEDNKPGKRVRTNSRSTP 

TIPQGKPETTFLDQGCSSPVLIDCPHPNCNKK 

YKHINGLRYHQAHAHLDPENKJLEFEPDSEDK 

ISDCEEGLSNVALECSEPSTSVSAYDQLKAPA 

SPGAGNPPGTPKGKRELMSNGPGSIIGAKAGK 

NSGKKKGLNNELNNLPVISNMTAALDSCSAA 

DGSLAAEMPKLEAEGLIDKKNLGDKEKGKK 

ANNCKTDKN\PSKLKSARP1APAPAPTPPQL1A 

JQPTATFTTTTTGTIPGLPSLTTTVVQATPKSPPL 

KPIQPKPT1MGEPITVNPALVSLKDKIOCKEKR 

KLKDKEG KETG SPKMD AKLGKLED SKG ASK 

DLPGHFLKDHLNKNEGLANGLSESQESRMAS 

IKAEADKVYTFTDNAPSPSIGS 


1139 


2489 


A 


9234 


207 


443 


TRRGQPWRRRAAAAGILPGREAAACLPSC/AS 

VTAAVSGLLVGYELGIISGALLQIKTLLALSC 

HEQEMGVSSLV1GALL 


1140 


2490 


A 


9238 


248 


328 


MAQGNNYGQTSNGVADESPNMLVYRKV 


1141 


2491 


A 


9242 


2 


535 


FVEAAVKMLGSLVLRRKALAPRLLLRLLRSP 

TLRGHGGASGRNVTTGSLGEPQWLRVATGG 

RPGTSPALFSGRGAATGGRQGGRFDTKCLAA 

ATu/nPT prTPFFTT PGOD^WNGVPSRAGLGM\ 

WPWAAALVVHCYSKSPSNKDAALLEAARAQ 

\NMQEVSRNRCALLHSAAVQEYGYGN 


1142 


2492 


A 


9245 


157 


466 


HLCF WFFV GLFLPEQQIMLFATLLRMAQGCD 
FALGNDFLNITTKAQA/TKEKLDKLDFIK1KTC 
CTSMDAIEKTEPLTKWTKAFVSHVSYKRLLF 
GICKEYSRQ 


1143 


2493 


A 


9247 


264 


115 


GLPQQTSTIQPPGTPDGARDFTSTIQPPGAPDG 
ARDSTSIIRMGPEIPPP 
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SEQ ID 
iNvj. or 
nucl- 
eotide 
seq- 
uence 


SEQID 
sskj. or 
peptide 
seq- 
uence 


Met 
noa 


SEQ 
UJ in(J. 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Giutamic Acid, 
F=Phenyl alanine, G=Glycine, H=Histidine, 
l=isoIeucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine ? S=Serine, 
i — i nrconme, v— valine, w— i ryptopnan, 
Y=Tyrosine, X=Unknown, *=Stop codon. 
/=possib!e nucleotide deletion. \=possible 

IlLlwCULltlC lllbclllUlJ 


1144 


2494 


A 


9260 


1 


401 


KKVPGRLSEMSFSLNFTLPANTTSSPVT\DCGP 
SLGLAAGIPLLVATALLVALLFTLIHRRRSSIE 

TMG AQEAHI YVKT V AG SEEP VHDR YRPTI EM 
ERRR 


1145 


2495 


A 


9264 


175 


41 1 


METIWIYQFRLIEIGDSTVGKSCLLHRFTQGRF 
PGLRSPACDPTVGVDFFSRLLEIEPGKRIKLLL 
WDTAGQERFISIT 


1146 


2496 


A 


9277 


592 


S14 


MFTYLEGREGIKSQPKMEPHSVTVRLECSGMT 

SAHCSLNLPGTSDSPASASRA^AGTTGMRHHA 

WLIFAFLVETGF 


1147 


2497 


A 


9279 


1255 


2 


FRRGRRG EEEKEEEEEEEEG WVN GMEN SHPP 

HHHHQQPPPQPGPSGERRNHHWRSYKXMIDP 

AL KKGHHKL YRYDG QHF SL AM S SNRP VEI VE 

DPRVVGIWTKNKE\LELSVPKFKIDEFYVDQV 

PPKQVTFAKLNDN1RENFLRDMCKKYGEVEE 

VEILYNPKTKKHLGIAKVVFATVRGAKJDAVQ 

HUISTSVMGNIIHVELDTKGETRMRFYELVLV 

TGRYTPQTLPVGELDAVSPIVNETLQLSDALK 

RLKDGGLSAGCGSGSSSVTPNSGGTPFSQDTA 

YSSCRLDTPNSYG/QGTPLTPRLGTPFSQDSSY 

SSRQPTPSYLFSQDPAVTFKARRHESKFTDAY 

NRRHEHHYVHNSPAVTAVAGATAAFRGSSD 

LPFGTVGGTGGSSGPPFKAQPQDSATFAHTPP 

PAQATPAPGFR 


1148 


2498 


A 


9302 


1026 


6 


IASIQNADTMPGVGLI.VSHFSTLVSRQRCPNY 

ADPQNLTDVSIFLLLEVSGDPELQPVLAGLFL 

SMCLvlVLGNLLIILAISPDSHLHTPMYFFFSN 

LSLPDV\GFTSTTVPK\MIVD1\QSRSRV1SYAG 

CLTQKSLFAIFGGTEE\NMLLSVMAYDRFVAI 

CHPLYHSAIMNPCFCAFLVLLSFFFLSLLDSQL 

HSWIVLQFTIIKNVEISNFVCDPSQLLKFACSD 

SIINSIFrVTHKDPERQLVLAGLFLSMCLVTVL 

GNLIIILDVSPDSFILPTPMYFFLSNLSLPDIGFT 

STTVPKMIVDIQSHGRVIFYAGCLTQMSLFAIF 

OOIV^iiKriAi bCJJOL 


1149 


2499 


A 


9303 


1 


699 


MASQEKDIFIGWGTIHLFRKPQRSFFGKXLRE 
r KL, V AAUKoMuK YMJLr Cj VINL1CTGFLLM WC 
SSTNSIAL'AS YTYLTIFDLFSLMTCL1S Y W VTL 
RKPSPVYSFGFERLEVLAVFASTVLAQLGALF 

11 VFCAFDFI PHPFTUT^Dr T VPTrUAI r^TTKJX c 

ii^JNJDD/\^Kr JLliV^rC.lrll OKJL.L VU 1 r V AX,L,.rINLr 

TMLSIRNKPFAYVSEAASTSWLQEHVADLSR 
SLCGnPGLSSIFLPRMNPFVLIDLAGAFALCIT 
YMLIEI 


1150 


2500 


A 


9308 


797 


693 


DRSTSVTRAGVQWCSLGSLQPRTPGLLRSSCL 

°J P 


1151 


2501 


A 


9309 


205 


406 


VAIKELPVLWKWSKPTRMAKEPPQTQQRAG 
SKTAAPPCQWSRMASEGPNIPCPGARHSDKQ 
FLICTI 


1152 


2502 


A 


9314 


913 


504 


KPSPLITPPAVVLPPSAVLNLVNTFSSFPQVEV 
OGPLCGPRKGRLAVT1PFFGT S/T PKYMT>WPR 
PPPHR\EIFFVFLAETGFHRASQAGPDLPTS/S/I 
PPTSA/FPKCWEYRSEPQCLPGCLSFSGILLDI, 
GTNVSLRAA 


1153 


2503 


A 


9315 


392 


I 


HPHRPRPGFRSPARSSRPCPVLTSLLPPFPSPSP 
P ADDL VKAGRDRKDPQ VR/ERRLRPNPG RLG 
GPR\PRPARARS/CHQPRLTRVCPRSPPPEARA 
PAPA-^PARGRGAPKRNRPRTDTRAPRGSSAR 
PGNS 
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3 DOC ID: <WO 01571B8A2_L> 



WO 01/57188 




PCT/U SO 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO. of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=GIycine, H=Histidine, 
I=IsoJeucine, K=Lysine, L-Leucine, 
M=Methionine, N=Asparagine, P=Proline. 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine r V= Valine, W=Tryptophan, 
Y=Tyrusine, X= Unknown, *~Stop codon, 
/^possible nucleotide deletion, Y^possible 
nucleotide insertion 


1154 


2504 


A 


9321 


331 


433 


MPCI/QAQYGTPAPSPGPRDHSASDPLTPEFIK 
PT 


1155 


2505 


A 


9324 


180 


275 


MEEPQSDPSVEPPLSQETFSDLWK1XSENNVL 


1156 


2506 


A 


9326 


383 


619 


MISPSRTEGDPLPLPP/EGEGQEVRGFGGGPAK 
E AAQRHCRA S V SILRMRRr u vG o bKr AK V r L 
RGPDSHRLREPPPSPP 


1157 


2507 


A 


9327 


152 


292 


YERRGRSQGGGSHPAGAQPGGRAIGAGWQS 
KEPLWEGLQRSGSPLPG 


1158 


2508 


A 


9328 


1 


430 


QELKQGPNPLAPSPSAPSTSAGLGDCNHRVD 

LSKTFSVSSALAMLQERRCLYVVLTDSRCFL 

VCMCFLTF1QALMVSGYLSSVITTIERRYSLKS 

SESGLLVSCFDIGNLVVVVFVSYFRGRRRRP/ 

RVAAVGGLLDLEGGEMT 


1159 


2509 


A 


9334 


108 


383 


KGNQVNGNGNQLKRKHESMCPVSLTQNTVR 
LMEAGLPQKQAERADELFEAGLVIYVKLDER 
VLNALXYSSVGLQWFKESDLSHLRLLEISFR 


1160 


2510 


A 


9338 


2 


430 


FVGRPRGLSDRLEDLFLAGFRVGERLRTAAM 
KRYVRILLLGEGAEHVADPVPGGRGVPRGEA 
DHTDQELREEIHKANVERWFIDVSQEAT1EKI 
RTKWIPLV/RWGDHA/EGPVGIKSYLPSGRSM 
EAELPIMSQLTEIETCVEC 


1161 


2511 


A 


9341 


1 


390 


NSRVDDFVAPGLSEAGKLLGLEFPERQRLAA 

AVG/CSPMSGVISMSAPFFLGKIIDAIYTNPTV 

DYSDNLTRLCLGLSGVFLCGAAANAIRYYLM 

QTSRQRVVKRLRTSLFSSILGQEVAFSDKAGT 

GELI 


1162 


2512 


A 


9343 


84 


837 


QGRFRAFCWQRDFLQPPGMRLSALLALASKV 

TLPPHYRYGMSPPGSVADKRKNPPWIRRRPV 

VVEPISDEDWYLFCGDTVEILEGKDAGKQGK 

VVQVIRQRNWWVGGLNTHYRYIGKTMDYR 

GTMIP SEAPLLHRQ VKL VDPMDRKPTEIE WR 

FTEAGERVRVSTRS GRI1PKPEFPRADGI VPET 

WIDGPKDTSVEDALERTYVPCLKTLQEEVME 

AMGIKETRNNTRRSIGIEPGAEQLLPNFCPSLE 

G 


1163 


2513 


A 


9346 


967 


616 


DSLALSPRLECSGAISAHCNLTPPGFTPFSCLS 
LPSSWAYRCASPHPDNFFVFLVESGFHHVGQ 
AGLKLLISSDPPTSA/FPKCWDYRRDNSSAPAT 
FSSYQKNNPDLILNDTIMPNIK 


1164 


2514 


A 


9347 


3 


1099 


SSFPTCMRTVFHSNTSVSSLLHRPGHVTPQLTI 

HG G WRHHRDHTAIDE WDFNPSKFLI YTCLLL 

FSVLLPLRLDGIIQWSYWAVFAPIWLWKLLV 

VAGASVGAGVWARNPRYRTEGEACVEFKA 

MLIAVGIHLLLLMFEVLVCDRVERGTHFWLL 

VFMPLFFVSPVSVAACVWGFRHDRSLELEILC 

SVNILQFIFIAUCLDRIIHWPWLVVFVPEWILM 

SFLCL V VLY YIV WSLLFLRSLD V V AEQRRTH 

VT^^ISW1T1VWLLTFEVLLVHRLDGHNTFS 

YVSIFVPLWLSLLTLMATTFRRKGGNHWWF 

AIRRDF/CQDQLPQPTGKPPPPPLTDHHGbKA 

LPLQNKDRGSWPASRGSPRLL 


1165 


2515 


A 


9362 


547 


991 


DVSIGPPLLRRPCSGREQTRSLSFPSDPESSFSP 

VPEGVRLADGPGHCKGRVEVKHQNQWYTV 

CQTGWSLRAAKVVCRQLRCGRAVLTVQKRC 

TKHAYGRKPIWLSQMACSGPEPTLHDCPFRP 

LGEDTLFHVEYTSVHGRERLSAKD 


1166 


2516 


A 


9363 


201 


387 


PPILRWTPPSGKNFFFFFFFESEFY/SSPRVECS 
GAISAFLAHCNLCLPGSSDSPASAFQVAS 


1167 


2517 


A 


9368 


707 


1087 


AVLTPCLSPCSPSRIPRPVSRPYPGRRSLSHTPP 
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nJSDOCID: <WO 0157188A2_I_> 



WO 01/57188 




T/USO 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/4 96 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspanic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isoieucine H K=Lysine. L=Leucine, 
M=Methionine 9 N-Asparagine, P=Proline, 
Q=Glutamine. R=Arginine, S=Serine } 
T=Threonine, V=Valine ; W=Tryp to P' ian > 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














PRPLILYAPAP\RPAGTAFEPHSHPPPPDLLRPT 
ATPA/TPCPSLPPPPRPLHPTQPSTALLPDPPPW 
PLPFPPPSS/RPPRPDCSTSYSPTFPPPT 


1 16S 


2518 


A 


9375 


511 


15 


MMLSEETSA VRPQKQTRFNG AKL V WMLKGS 
PITVTSAVIIVLMLLMM/IFSPWLATHDPNAED 
LTARLLPPS AAH WFGTDE VGRDLFSRVL VG S 
QQSILAGLVVVATTGMIGSPLECLFGELGGRA 
DAIFMRVMDIMRS/IPSLVLTMEKTAALGPSL I 
FNAMQASSEH 


1169 


2519 


A 


9377 


42 


410 


GNGR VAPRDPG A V AS AEPGLTTHDS G VNPN 
NSARRME AMAS G SN WLSGVNVVLVMAYWS 
LVFVLLFIFAKRQIMRFAMKSLRGPHGPVGH 
NAPKDLKEEIDILLSRVHNIKYEP\HLLADDDA 


1170 


2520 


A 


9378 


302 


1303 


GVSGFSASVLRQRRMEDELEPSLRPRTQIQGR 

ILLLTICAAGIGGTFQFGYNLSIINAPTLHIQEF 

TMETWQARTGEPLPDHLVLLMWSLIVSLYPL 

GGLFGALLAGPLAITLGRJCI<.SLL\\ r MNIFVVS 

AAILFGFSRKAGSFEMIMLGRLASWGVNAGV 

SMNIQP\MLPGGESAPKELRGAVAMSSAIFTA 

LGIVMGQVVGLSTTAATGLRGLXAGELEELEE 

ERAACQGCRARRPWELFQHRALRRQVTSLV 

VLGSAMELCGNDSVYAYASSVFRKAGVPEA 

KIQYAIIGTGSCELLTAVVSVSLEGALPPPAL 

WGGTPRSFALNQFTLQKKKK 


1171 


2521 


A 


9381 


2 


412 


RGPASAQEDERARTAPLERVRARGRMTTSSA 
LFPSLLPCSWSTSNKYLAEFR.^GICMSLKGTTE 
TPDKRKGLAY/IQQTDDSLIHFCWKDRTSGNV 
EDDLIIFPDDCEFKKLPQCPNGRVYVLKFKAG 
SKRLFFWMQEP 


1172 


2522 


A 


9384 


20 


355 


GWNGRSTEASPAAEAPHYPHKET\KAAJs4GTQ 
CTHGGKVRPDPHDMLTTVVHKIKLFVLCHSL 
LQLCAIM1SDYLKSSJYTVEKRLGLFRPTSGLL 
ASFNEVGNTALIVLESY 


1173 


2523 


A 


9393 


430 


87 


LCQCIVPGQQKETFSLNPSSATVRFYL*LSLQ 
QRKEDQ* IIL* YHLNKDCLHIFMSAITLYMKI* 
KIFVLFDFNIMFETPFYII*FIFLFSQNLKRIRQV 
IRPPI SFSKINNG P 


1374 


2524 


A 


9397 


77 


374 


ERLEIGREGGERGSGPASCLRVIDVSGMWDQ 
RLVICL ALLQLLRAF YGIKVKG VRVI IRDCGTF 
ESSSTLIRVS * FG VPCNALAHFG VTHF* YILDF 
LGML 


1175 


2525 


A 


9399 


66 


397 


HESSRADRDKMDTRGSTYTDADPVNKSGGT 
AiCMNKWSKGKVRDKLNNLVLFDTATYDKL 
CKEVPNYKLITLAVVSERLKIPGSLARAALHE 
LLSRGLI*LVIQHIAQVIY 


1176 


2526 


A 


9408 


2 


299 


LDI/THVLSLSISLTVTILGTTFGMVIPIXDVVY 
GERG Y A QNGDF* DAQLDD YSFSC Y SHAQVN 
GAPNSLTRA YDDP* VKISGLECQKVG ALVE V 
KCLNL 


1177 


2527 


A 


9416 


2 


402 


CNFLRSSR1RVHSTPAASTMPPKVDPNEIKVV 
YLRC I CiOr, VKA rbALAPKJGPLGLSSIICVG VD 
FV*ATGDWNVL11SVILT11ULLSHIFVVPPFFCF 
DHL1AFWDLQSLIFLHVIFSLFITLLLFCFFSIF 


1178 


2528 


A 


9419 


142 


426 


TPLFDLWPRVVLSWLETVLTSLRTRRAASGPP 
ACRIMPTTVDDVLEHGGEVHFLQKQMLYLL 
ALI* DTFAPIYVGIVFLGFTPDHRCRSPGVAEL 


1179 


2529 


A 


9420 


1450 


1655 


LSSAGTKMNLN* KNY WPG ASAHACNPSTLG 
GQSRCITRSGDRDHPG* HGETPS VLKJQKISRA 
WWRAP 
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5DOCID; <WO 0157188A2J„> 
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NO: of 1 
nucl- | 

seq- 
uence 


SEQ ID ] 
MO; of 
Deptide 
seq- 
uence 


Viet 
tiod 


SEQ i 

[DNO: 

in 

USSN 
09/496 
914 


Predicted 1 

Deginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E^Glutamic Acid, 
F=Phenyl alanine, G=Glycine ? H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucinc, 
M=Methionine, N=Asparagine, PHProiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine ? W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


1180 


2530 


A 


9422 


176 


375 


HRPQTTRPD WKPRT*PQGK* GRLSSEISPASPP 
SRFSRSTKPVPPKADPPARQKLTGVLHAPLLK 

L 


1181 


2531 


A 


9436 


2 


274 


"PIAASLRMYNLQPYTEENLICTAFATMVETVP 
IARTELDRLTGIPHG YCF VE* AD W ATADKCVH 
IYNGKPLPGATPLLSLQLHQLAHLGS 


1182 


2532 


A 


9442 


3 


240 


VDKCSSKSIVLSEYCPHCMCSLSTDPKPFGQL 
SM1LK* MG AGDEKI S AMGKAR VDHREL YLGL 
LYPTEDYKLTFRARH 


1183 


2533 


A 


9444 


384 


3 


LKDFQPWALHDWPLFCCCTFLLFLVLECFTR 
KGCSGWAPWLSLQCQHFGRPRWADHLRSGV 
RDQPGQYSKTTFLPK1QKLAGHSGAHL* S*LL 
ERMRWKNRLNPGGRSCSEPRWHHCTPGWAT 

ERG 


1184 


2534 


A 


9462 


391 


655 


LSGFKSLMPKIPLQYIYVRVRTTWSt (JLPLDU 
RKLMLS*YSK*LT*KYNILPEYSRMTLPPGMV 
IHTCNPSTLGGR AG WTV* AQEFET 


1185 


2535 


A 


9467 


215 


566 


" RCPMWQGQASRMDPAKAKDREASTCCSLA 
WWWGWECWVRALKLSSGPAGPLACWVAK 
KKSLSLSGPY YPSEKG AGLYVF* DRVSLCI IPG 
WSAWQFWLTAASNSCFSLLSSWDYRCA 


I 1 oo 


2536 


A 


9468 


275 


452 


HIPQLHTKTH Y VPTRMVNK I*QIDNSKPWQR 
GG*TGILTHCW*ESKLVQPLWKIVWHYQ 


1187 


2537 


A 


9469 


388 


3 


EVAPGPSQILPRRVTDGGDRPQFSLPGPRLPQ 
SSRGAEPCLSNCIHSPAPRKQRMGDSDQ*STP 
NPASPHPEAPQEPWDSASGSVGSFSLGRGAK 
ASS* VPGKGRGPRQGSELLAETILELFLALAN 
S 


1188 


2538 


A 


9471 


124 


397 


TMDKKNRHGNSLDMASEIHMTGPMCLIENTT 
GRLMANPEALKILSAITQPMVEEA1AGLYRAC 
*FYLTNNLAGMKKGLCLGSTEQAHTIGI 


1189 


2539 


A 


9480 


584 


769 


GHVQSQHFGRPRRADHLRSGDRDHPG*HDET 

PSLLKIQKISWAWWRAPVVPATWEAEAEEW 

K 


1190 


2540 


A 


9483 


463 


86 


VT VGLTLLLRG APRFTAG* PPSGGGPPL APLL 
PRQHCTLQTHRHLHPE AP VKV* KT* RLFPGLR 
GASSCRRRRCNPVLAARKAGSPRSHSTRENC 
RRSRCPDTAHRRRRRGRRRNPSCVRSPRWR 


1 191 


2541 


A 


9489 


1 


411 


" LADALCLSAAATGAVRPGARAQPSTRRRLSP 
SVRVCCRAAAASNLLYSSCLQRHSERASEEG 
ERGSLSAKCCSLVLRGGCSSSNSHSFRRIT*EI 
MAAFVLLSYEQRPLKRPRLGPPDVYPPDPKQ 
KEEELTAVNVK 


1192 


2542 


A 


9497 


389 


161 


VSFLSMSSGHC1RSTRGSKMVSWSV1AK1QE1* 
CEEDERKMAREFLAEFMSTYVMMMHMrVE 
KDTYSDHEEINTS 


1193 


2543 


A 


9509 


186 


1 


" IAKSQ*KRWQRSGAMETLKHGWWECKLVQF 
FGKTFVNVN* S*TYVYPCDKIILLLGLYPTEM 


1194 


" 2544 


A 


9512 


58 


433 


PLQRSKCLTLRCLRAKPWAWSQSPRACSSAL 
LKSSRSRASSLNVQCILQSNPQGHQRJ* KQKA 
SSKGQQFRR*KEHPFMLKTLNKLRIEGT*LKI 
RRAIYDNPTANI1VEGQKLEAFPLRTGTRQ 


1195 


2545 


A 


9515 


595 


1223 


' GHG APSFQTQ VPRTP* AS WP V VPAASES APAP 
AGGGASLPVAAGSCAAAPHTEPGAPQHLLDC 
PCPLCLARPPRRPLPDTCYGPGSGRSASLAEPP 
LPRCSCAPLRSASAPQVS*CV*AVNLLPIINL* 
PLHLLLHD*EKAWGFLFSSASHCFQGQICLLP 
APGSGPCGA^ARPSRGGRAGGSRARRPIPPGP 
GTRRTPSGCQNPAASGG 
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MSDOCID <WO 0157188A2_i_> 



WO 01/57188 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alajiine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenyl alanine, G=Glycine, H=Histidine, 
I=Iso leucine, K=Lysine, L=Leucine, 
M-Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tlireonine, V=VaIine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 


1 196 


2546 


A 


9518 


229 


468 


RSPTATPAPILAMGPGAPFARGGRPLPLLGAM 
AERVAPGWDLHTPYLPRTNSRRTPHL* *EPH A 
GYIGALFPMSGGWPGGQ 


1197 


2547 


A 


9521 


289 


448 


IAWLSGLFFPSNQANLCFLCYKL'i ADSRYRG 

I T A X JTTt T TT r T"/~> , V TTOX A A TT» T^T * A nHTI 1~* /*\ 1 7V~\T> A turn 

HAMRHLTGNTSMAIRFL* ADSRFQ VQRAR YE 
APNWKYKYGY*IPVDMLC 


1198 


254S 


A 


9524 


204 


I 


KNKKTTKCLSIVTLNISGPNQ*NKRHRVAEWI 
VKQEPN1CHL* ETHFPFRDTYRLKEREQKKRK 
SSYS 


1199 


2549 


A 


9546 


1785 


1943 


GGRFKESKLTNAGWQRNSFFIGPPKSIPWAA 
V*QRGDGKNPGVTHLNRPVGTX 


1200 


2550 


A 


9548 


186 


1 


VNAEKEF*KIQHYFMTKSQNKLH1EHTYLICPI 
KAIYDK\VTSDIMLNLQKL*AFFLRVIVRQI 


1201 


2551 


A 


9549 


591 


2 


SSVVEFPRGPRSSLPPLDSTFPCGSSPNWTGGC 
GSCPSGE*LVSPGSEQRKKYSNSNVIMHETSQ 
YHVQHL ATFIMDK SE AITS VDD AIRKL VQL S S 
KEKIWTQEMLLQ\rNDQSLRLLDIESQEELEDF 
PLPTVQR SQT VLNQLRYPS VLLLVCQDSEQSK 
PDVHFFHCDEVEAELVHEYMESALTDCRLGK 
AMRP 


1202 


2552 


A 


9552 


428 


1 


KYGNEGHWSRQCPNPGKPIRPCPLCRGPHWR 

LDCERPPQGPLPSLPELAKTSYSDLTGLATED 

*WGPGMDAPATTIASSKTRVTLMVAGRPVFF 

LI*YRATYSALPNFSGPTQSSQVSVVGIDGQV 

SKPRATPPLFCSLHTF 


1203 


2553 


A 


9568 


517 


73S 


RRKFERKQKQ*RYREGKQYRQRDKMKEWG 
EKEKRRREKGEREERKMRHRERKGESGQRD 
TMENWRVERLTEKJER 


1204 


2554 


A 


9573 


83 


415 


EDBCRLRLVDGDSRCAGRV*IYHDGFWGTICD 
DGWDLSDAHVVCQKLGCGVAFNATVSAHFG 
EGSGPIWLDDLNCTGTESHLWQCPSRGWGQ 
HDCRHKEDAGVICSEFTALR 


1205 


2555 


A 


9577 


64 


424 


ARGSCPTRPRTANGRMGETKDAPQMLVTFK 
DV A VTFFREEWRQL VLVHRTL YR* GMLETC 
GLLDTLRHN\^QPDVVFILLYHGTQLL1V1CRE 
VSHSPCAGDMRELFTREATLTPHPYNNGA 


1206 


2556 


A 


9584 


38 


476 


TLGAVLFSEVSKESSTSHSGGQLGRQNRHPKL 
SNFITPSSPRLKP*TASSQRNLGQILNMFLTAV 
NPQPLSTPSWQIETKYSTKVLTGNWMEERRK 
GLP YKHLITHHQEPPHRYLI ST YDDHYNRHG 
YNPGLPPLRTWN GQKLL WL 


1207 


2557 


A 


9586 


2 


412 


LRSSPAALLRALCITTVTGTALALRSRVATTN 
PDGCRNVLRPKYYRLCDKAESWGIALETVPT 
GVAVTSWAIMLTVLTLVCKGQDYNRRQKLP 
THILCLL*EKGIFGLTFAFIIGLDGSTGPTRFFL 
FGILFS1CFS 


1208 


2558 


A 


9597 


122 


3 


1KNYWPGMVAHACNPSPLGGRGRWIA*AQK 
FADAWADAW 


1209 


2559 


A 


9611 


148 


558 


KSLRNVWDLLNNTWKADRFFCHSSRTSTIRK 
GDPGPTFSKMSIWTSGRTSSSYRHDEKRNIYQ 
Kll<iJrlJLJLLL)KJKK I VTAJLkAGEDRAILLGLAM 
MVCSIMM*FLLGITLLRSYMQSVWTRESQCT 
LLNASITETFNC 


1210 


2560 


A 


9618 


384 


2 


SLHDMLMLAEQQQKQKWAVNTQNTAWSNA 

DSBCFGQR1LEKMEWSKGRGLGVQEQGGPDD1 

KVQVKNNDLGLQATINNEANWIAHQDDFNW 

LLAELNTCQRQETADS***WSPKNSHVGKDS 

GELSAK 


1211 


2561 


A 


9620 


316 


610 


QKHPGGGQL GRSPQEDSRFHNKAS SG VS R VR 
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SEQ ID 5 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID I 
MO: of 1 
peptide 
seq- 
uence 


Viet < 
lod 1 
i 

1 


5EQ I 
D NO: 1 
n i 
USSN 
09/496 
914 


Predicted 1 
beginning J 
lucleotide 
ocation 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end < 
mcleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcine, 
D=Aspartic Acid, E=Glutamic Acid, 
^Phenylalanine, G^Glycine, H=Histidine, 
f=Isoleucine, K=Lysine, L=Leucine, 
Vl=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, W=Tryptophan, 
y=Tyrosine 9 X=Unknown, *=Stop codon 3 
/=possible nucleotide deletion, \=possioie 
nucleotide insertion 














LGRAWWLTPVIPTLWEAKAGGSPE*D*AUKU 
GSRL*SQHFGRPRRVDHLRSAVQDQPGQHGE 
ypSLLKIQKIN V WuKKL oo i o >j ^ 


1212 


2562 


A 


9623 


297 


344 


QFPVDGDYQKIEKITQLFQAQNLSLCLAM1K 
TREL* KGvjGKGK-H.il *A V VrrJ^JSJSAJO i vjviv/vr 
AILNTSNCT* CF*ETKMLSDDPKAC VFE VSSA 
DL*NTSFGVIR 


1213 


2563 


A 


9624 


2 


356 


AELSLASTACGRN 1 bCjUbbl U i uivftxioar 
TSGT1LSAISCLWDLPTPVLRVGLSCQPSMSSQ 
IPRMYSTDVEAAVNSLEDLYLQAYYAYLCVG 
LYFHRDDMALEGVSRFJL*ELAE 

~r>r pnunm A T^l * \ 7T»"VTMV^'PXTr , I "MPT? firVCTp'^FPR 


1214 


2564 


A 


9634 


776 


912 


SLSRWVRAIvL *VPYNvliiNULjrNirivovj\j^oJ-«riv 
SHYCTPAWATEKDS 


1215 


2565 


A 


9636 


220 


426 


KPGNFAVSSEY*DITSGQLKTAVRG*JEMl St 
EENFGEKLHDIGFGNGFLDKT* KAQ ATKAK1 


1216 


2566 


A 


9637 


391 


76 


CFLEDGCTQAb*AfaisAA VM'&M/uir,iiW<J2> i 
RERRSIRFKMKNHSPDDTIKENVTISNIRTRICI 
NHLPETERNLLEHGLMYIRLNAAFCSLVAHS 
LFGF1LKAT 


1217 


2567 


A 


9655 


2008 


2432 


" LHCKMGALETQTHPCSQNMLRSLQKCJCCKV 
EEHHLQPVQVLQTLLHSATAGTGCRKFAKPr 
PAPPTPTPWRSRQSGKQSERAS*LKGRGRYGL 
GALGGRGGRALGGSRWPPPLPGETLFSGCKH 
RRRRRGSDAAPGEEAGT 


1218 


256S 


A 


9658 


3 


405 


HASARALLSPNLSPNNlvMAisoorvLorrilA 

VLMSAQEPWAIKEEHVIIQAEFYLNPDQSGEF 

MLDFEGEDTFHGDMAKKETVWRLE*LARLD 

NFEAQRALANIAADQAALEIMDMGSDYTLIP 

NVPPKVTVL 


1219 


2569 


A 


9662 


3 


284 


"PDWTEKRKMQDTGSILPLHWFGFGYAALVA 
YGGIIG YVKAGS VPSL AAGLLr GSLbui^Li A y y 
LSQDPRNVWVFLATSGTLAGIMGMRFYHSG 

KL 


1220 


2570 


A 


9669 


200 


699 


LLLTGY1QTLQNQQLSGNQQEMQAVDNL 1 bA 
PGNTSLCTRDYKITQVLFPLLYTVLFFVGLITN 

. . »n tru '/Mil fiT'CiMC'TTt?! V "M' P\/T CTl I 1 N/TFT TP 

GLAMRIFFQIRSKSNFIIrLKN I VibDi-iwiviii^ir 

PFKILSDAKLGTGPLRTFVCQVTSVIFYFTMY1 

SISFLGLITIDRYQKTTRPFKTSNPKNLLGAKIL 

K 


1221 


2571 


A 


9676 


164 


562 


KERDSSTFSAAM1 I MQGMb^AJVLru ALiro v r 
QLGNMAVIHSHLWKGLQEKFLKGEPKVLGV 
VQILTALMSLSMGITMMCMASNTYGSNPISV 
YIGYTIWGSVMrUoOoL.oif\AOAis.i i xwji_ v jvvj 
SLGMNITSS 


1222 


2572 


A 


9688 


43 


412 


" VAKMVKCCSAIGCASRCLPNSKLKGLTFHVF 
PTDENIKRKWVLAMKRLDVNAAQIWEPKKG 
DVLCSRHFIOCTDFDRSAPNIKXKPGVIPSIFDS 
PYHLOGKREKLHCRKNFTLKTVPATNYNH 


1223 


2573 


A 


9696 


308 


564 


RTSMGILYSEPICQAAYQNDFGQVWRWVKE 
DSSY ANVQDGFNGDTPLTC ACRRGH VRI VSFL 
LKKECLCQPQKPERENLLALCCE _ 


1224 


2574 


A 


9700 


3 


632 


" DAWASGGELGSLFDHHVQRAVCDTRAKYKb 
GRRPRAVKVYT1>JLESQYLLIQGVPAVGVMK 
ELVERFALYGAIEQYNALDEYPAEDFTEVYLI 
lO^MNLQSARTAKRKMDEQSFFGGLLHVCYA 
PEFETVEETRKKLQMRKAYVVKTTENKDHY 
Vl'KKKLVTEHKDTEDFRQDFHSEMSGFCKA 
ALNTSAGNSNP^XPYSCELPLCYFSSK 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F==Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K=Lysine_. L=Leucine, 
M=Methionine, N=Asparagine, P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V— V aline, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 


1225 


2575 


A 


9710 


1 


163 


RSGCVLRMTEWETGAPAVAETPDIKXFGKWS 
TDD VHINDI SLQD YI AG VRLILL 


1226 


2576 


A 


9713 


82 


492 


QGLPSFLPAFGPSGSWLGPAPTLGSSCNTVDT 
ICHGYSEIRPLFYLSFCDLLLGLCWLTETLLYG 
ASVANKDIICYNLQAVGQIFY1SSFLYTVNYI 
WYLi 1 bLKMKJ-i I QbuQb I £>FLV ID Y ICR VCQ 
MAFVFSSL1 


1227 


2577 


A 


9720 


3 


416 


GKWKRTQVPLLGEECADMDLARKEFLRGNG 
LAAGKMNISTDI ,DTNY AELVLNVGR VTLGEN 
NRKKMKDCQLRKQQNENVSRAVCALLNSGG 
GVIKAEVENKGYSYKKDGIGLDLENSFSNML 
PFVPNFLDFMQNGNYF 


1228 


2578 


A 


9723 


278 


411 


EASSSNTVASNVADKTDPHSMNSRVFIGNLN 
TLVLQKSDVEAVF 


1229 


2579 


A 


9725 


121 


902 


LFAMSGFENLNTDFYQTSYSIDDQSQQSYDY 

GG SGGP YSKQ Y AG YD YSQQGRFVPPDMMQP 

QQPYTGQIYQPTQAYTPASPQPFYGNNFEDEP 

PLLEELGINFDHIWQKTLTVLHPLKVADGSIM 

NETDLAGPMVFCLAFGATLLLAGKIQFGYVY 

GTSAIGCLGMFCLLNLMSMTGVSFGCVASVL 

GYCLLPMILLSSFAVIFSLQGMVGIILTAGIIG 

WCSFSASKIF1SALAMEGQQLLVAYPCALLYG 

VFALISVF 


1230 


2580 


A 


9739 


11 


247 


TFVLNMNTPKEEFQDW^IVRIAAHLPDLIVYG 
HFSPERPFMDYFDGVLMFVDISGKCKRDVCL 
MWMSNRLAWEFTCRA 


1231 


2581 


A 


9744 


37 


1100 


TPLFDFWPGFVLSWLQPLSASLRARRAASGPP 

ACRIMPTTVDDVLEHGGEFHFFQKQMFFLLA 

LLSATFAPIYVGIVFLGFTPDHRCRSPGVAELS 

LRCGWSPAEELNYTVPGPGPAGEASPRQCRR 

YEYDWNQSTFDCVDPLASLDTNRSRLPLGPC 

RDGWVYETPGSSIVTEFNLVCANSWMLDLFQ 

S S VN VGFFIG SMSIG YIADRFGRKLCLLTT VLI 

NAAAG VLM AI SPTYT WMLIFRLIQGL VSKAG 

WLIGYILITEFVGRRYRRTVGIFYQVAYTVGL 

LVLAGVAYALPHWRWLQFTVALPNFFFLLY 

YWCIPESPRWLISQNKNAEAMRIIKHIAKKNG 

KSLPASL 


1232 


2582 


A 


9753 


164 


517 


PGPGMQGPPPITPTSWSLPPWRAYVAAAVLC 
YINLLNYMNWFIIAGVLLDIQEVFQISDNHAG 
LLQTVFVSCLLLSAPVFGYLGDRHSRKATMS 
FGILLWSGAGLSSSFISPRYSWLF 


1233 


25S3 


A 


9757 


25 


419 


LPAPWTERVRKSEGLVGTCLGDPMASPRTVT 
IVALSVALGLFFVFMGTIKLTPRLSKDAYSEM 
KJIAYKSYVRALPLLKKMGINSILLRKSIGALE 
VACGIVMTLVPGRPKDVANFFLLLLVLAVLF 
FHQLV 


1234 


2584 


A 


9765 


71 


456 


RLELDWGFSLHFLPVAYLCPLSSGFEMNVQP 
CSRCGYGVYPAEKISCIDQIWHKACFHCEVC 
KMMLS VNNF VSHQKKPYCHAHNPKNNTFTS 
VY HI rLINL-N VK1 ri^ilAlouJillJQbI^Obv^Ci l vc»V 
FHWD 


1235 


2585 


A 


9767 


52 


559 


IRSGAMSVDKAELCGSLLTWLQTFHVPSPCA 

SPQDLS SGL A V A Y VLNQIDPS WFNE A WLQGI 

SEDPGPNWKLKVTSGLLIRGQTGEEMTRDGP 

ARHMSWVMGRKJIDRCLVINHLFIHSSMEYSP 

C.^RPGHSARNNTDKNLPHTAIILVTSNTYTTI 

KFNFQAGRSGSCL 


1236 


2586 


A 


9770 


352 


608 


FRGEALTVRFLTKRFIGEYASNFESIYKKHLC 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 1 
ID NO: ! 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=GIycine, H=Histidine, 
l=Isoleucine, K=Lysine, L=Leucine. 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrinc, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown> *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














LERKQLNLEI YDPCSQTQKAKFSLTSELH W A 
DGF VI VYDISDRSSF AF AKAL1 


1237 


2587 


A 


9793 


266 


515 


NILAIIYFPFPRLFLLRDSQSNPKAFALTLCHH 
QKIKNFQILPVSIDALTPPLVVCFLVSFLTHFS 
RYKPTRPVCITQFQGCS 


1238 


2588 


A 


9802 


537 


967 


ELGAGRSDREAMEAA VKEEI S VEDEA VDKNI 

FRDCNKIAFYRRQKQWLSKKSTYRALLDSVT 

TDEDSTRFQIINEASKVPLLAEIYGIEGNIFRLK 

INEETPLKPRFEVPDYLTSKPSTVRLISCSGDT 

GSL1LADGK-GDLK.C 


1239 


2589 


A 


9805 


105 


540 


VPGDPAMVRAGAVGAHLPASGLDIFGDLKK 
MNKRQLYYQVLNFAM1VSSALM1WKGLIVLT 
GSESPIVWLSGSMEPAFHRGDLLFLTNFRED 
PIRAGEIVVFKVEGRDIPIVHRVIKVHEKDNG 
DIKFLTKGDNNEGDDRG SYK 


1240 


2590 


A 


9819 


3 


305 


TDGRDPLPCAARRRGGGGECCGAGWVAEWS 
PQPLDPAMLLWMQGFVLEAVACQDNDDYLR 
YGILFEDLDCNGDGVVDI1ELQEGLRNWSSAF 
DPNSEEHG 


1241 


2591 


A 


9834 


841 


1209 


S P ARGKSNRTD VM IT APKNKKMTENL AAPE A 
LDSSTHSSSTATQSRAKMNTPAPTPSTVPAIPR 
GGSGGPPPCAPHDRVSSVLQCDTQAMDHKTE 
SSHSVVEFLFKRTKTPSPFHPAVRENRN 


1242 


2592 


A 


9843 


3 


589 


TISCGPATEPPASLLSSASSDDFCKEKTEDRYS 

LG S SLDSGMRTPLCRI CFQGPEQGELLSPCRC 

DGSVKCTHQPCLIKWISERGCWSCELCYYKY 

HV1 AISTKNPLQ WQ AI S LT VIEKVQ V AAAILGS 

LFLIASISWLIWSTFSPSARWQRQDLLFQICYG 

MYGFMDVMIVAVDSEDMVQAAKEVGKRWS 

DIPP 


1243 


2593 


A 


9846 


198 


411 


WRISHHAGKMPVMKGLLAPQNTFLDT1ATRF 
DGTHSNFILANAQVAKGFPIVYCSDGFCELAG 
FARTEVMQ 


1244 


2594 


A 


9848 


116 


650 


PICGFLYLCSAMASESSPLLAYRLLGEEGVAL 

PANGAGGPGGASARKLSTFLGVVVPTVLSMF 

SIVVFLRIGFVVGHAGLLQALAMLLVAYFILA 

LTVLSVCA1ATNGAVQGGGAYC1LQHRWTG 

VWPVLPAREVMISRTLGPEVGGSIGLMFYLA 

NVCGCAVSLLGLVESVLDVFGA 


1245 


2595 


A 


9849 


573 


1620 


KSKCRFPEGLSEGFGPMRJCEALSSGSVQEAJE 

AMLDEPQEQ AEGSLT VY VI SEH S SLLPQDN4M 

SYIGPKRTAVVRG1MHREAFN1IGRR1VQVAQ 

AMSLTEDVLAAALADHLPEDKWSAEKRRPL 

KSSLGYEITFSLLNPDPKSHDVYWDIEGAVRR 

YVQPFLNALGAAGNFSVDSQILYYAMLGVNP 

RFDSASSSYYLDMHSLPHVINPVESRLGSSAA 

SLYPVLNFLLYVPELAHSPLYIQDKDGAPVAT 

NAFHSPRWGG1MVYNVDSKTYNASVLPVRV 

EVDMVRVMEVFLAQLRLLFGIAQPQLPPKCL 

LSGPTSEGLMTWELDRLLWARSVENLATATT 

1 LI Ol_>-rV 


1246 


2596 


A 


9850 


114 


464 


PPQLGAQRVREPRHPDVRAPLRVTSPGLRSRS 
ARSLGRRPR1AMVTVGNYCEAEGPVGPAWM 
QDGLSPCFFFTLVPSTRMALGTLALVLALPCK 
RRERPAGADSLSWGAGPRISSYV 


1247 


2597 


A 


9851 


2 


327 


FVRNKKMTRSCSAVGCSTRDTVLSRERGLSF 
HQFPTDTIQR SK WIRAVNR VDPRSKKJWIPGP 
GA1LCSKHFQESDFESYG T RRKLKKGAVPSVS 
LYKVFKYSSRCTS 
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SEQ ID 
NU: oi 
nucl- 
eotide 
seq- 
uence 


SEQ ro 
inU: ol 
peptide 
seq- 
uence 


Met 
i_ _ j 
nod 


SEQ 

ir\ via . 
ID NU. 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcine, 
D=Aspartic Acid, E— Glutamic Acid, 
F=PhenyJ alanine, G=Glycine, H=Histidine, 
l=lso!eucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine 3 R-Arginine, S=Serine, 
T~Threonine, V=Valine, W— Tryptophan, 
Y—Tyrosine, X—Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possib!e 
nucleotide insertion 


124S 


259S 


A 


9853 


58 


444 


RVDDFVYSKGGKDAGGADVSLACRRQSIPEE 

FRG1TVVELIKKEGSTLGLTISGGTDKDGKPR 

VSNLRPGGLAARSDLLNIGDYIRSVNGn-ILTR 

LRHDEIITLLKNVGERVVLEVEYELPPPGGCP 

WT 


1249 


2599 


A 


9856 


2 


1265 


LPPPRPSRHRRGRAGTRASAAAAAGPT V S AV 

RAPVRGQDSGAGTPQGRLAGRGAHLSRVGA 

S G SG V AAGP AARIIAPRRRC AD AG E A VG ASC 

GRCAVALLSGVCTLVSTHVCVGSGCPGAAGT 

PMGAGDAGASAESAVTTAPQEPPARPLQAGS 

G A GP APGRAMRSTTLLALL AL VLL YL VS GAL 

VFRALEQPHEQQAQREL'GEVREKFLRAHPCV 

SDQELGLLIKEVADALGGGADPETNSTSNSSH 

SAWDLGSAFFFSGTIITTIGGGGDWHVGGGK 

ELPHGGRCRETEGSQVAPRLPASPLCPGYGN 

VALRTDAGRLFCIFYALVGIPLFGILLAGVGD 

RLGSSLRHGIGHIEAIFLKWHVPPELVRVLSA 

MLFLLIGCLLFVLTPTFVFCYMEDWSK.LEAIY 

FVIVTLTTVGFGDYVA 


1250 


2600 


A 


9873 


2 


652 


FVVPSPCGGIPGRAPNGASRPTMGNSASRNDF 

EWVYTDQPHTQRRKEILAKYPAIKALMRPDP 

RLKWAVI,VLVLVQMLACWLVRGLAWRWLL 

FWAYAFGGCVNHSLTLA1HD1SHNAAFGTGR 

AARNRWLAVFANLPEGVPYAASFKKYHVDH 

HRYLGGDGLDVDVPTRLEGWFFCTPARIvLL 

WLVLQPFFYSLRPLCVHPKAVTRMEVLNTLV 

QLA 


1251 


2601 


A 


9875 


150 


1209 


PVIMPLHFSPGDIVRPSCCVSSSPKLRRNAHSR 

LES YRPDTDLSREDTG CNLQH1 SDRENIDDLN 

MEFNPSDHPRASTIFLSKSQTDVREKRICSLFIN 

HHPPGQIARKYSSCSTIFLDDSTVSQPNLKYTI 

KCVALA1YYHIKNRDPDGRMLLDIFDENLHPL 

SKSEVPPDYDKHNPEQKQIYRFVRTLFSAAQL 

TAECAIVTLVYLERLLTYAEIDICPANWICRIV 

LGAILLASKVWDDQAVWNVDYCQILKD1TVE 

DMNELERQFLELLQFNINVPSSVYAKYYFDL 

OCT A "C A XTXTT OTTTJT T7 Til enm A T 11/ t r A Ipni /~iTT\ 

KoL Ab AN N Li>r PLEr L SRERAliKLE Al SRLCED 
KYKDLRRSARKRSASADNLTLPRWSPAIIS 


1252 


2602 


A 


9879 


6 


376 


KRPDSRPP AQ YR A GPTRPRTRGCELL Y WKAT 
KAVGIKMGSLSTANVEFCLDVFKELNSNNIG 
DNIFFS SL SLL Y AL SM VLLG ARG ETEEQLEK V 
WNooLVCooi^KoLbCbKoUoArU^lLbLYQ 


1253 


2603 


A 


9880 


180 


388 


KEQAELLYGLYCQCDLTLSSHPSSVPAMSSC 

NFTHATFVLIGIPGLEKAHFWVGFPLLSMYVA 

AMFGNC 


1 O^A 




A 

A 








Vibr Qll 1 u I lMDbS 1 AHSPVFL Vr PPEH ASEYE 

STELSATTFSTQSPLQICLFARKMKILGTIQILF 

GIMTFSFGVIFLFTLLKPYPRFPFIFLSGYPFWG 

SVLFINSGAFLIAVKRKTTETLI1LSRIMNFLSA 

LuAJAULJLLTFErHPRSKLHL 


1255 


2605 




9896 


72 


JOO 


GATCVGLPNVGMCPQLSGALTFMYLQQGNQ 
EATVAPDTMAQPYASAQFAPPQNGIPGEYTA 
PHPHP APE YTG QTT 


1256 


2606 


A 


9902 


95 


399 


SGGPAGLLHRPVT.PKMGLSGLLPILVPFILLG 
DIQEPGPIAEGILGKPCPKIKVECEVEEIDQCTK 
PRDCPENMKCCPFSRGKKCLDFRKVSLTLYI-1 
KEELE 


1257 


2607 


A 


9905 


374 


459 


EHLKSTPNRLGVVAHTCNPSTLGGRGGW 
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c 

1 

I 

t 


>EQ ID J 
^O: of T 
lucl- \ 
jotide : 
>eq- i 
lence 


SEQ ID I 
nIO: of 1 
jeptide 
;eq- 
lence 


Viet S 
lod 1 
i 
1 


3EQ 1 
DNO: \ 
n i 
JSSN 
39/496 ( 
914 


Predicted 1 
beginning i 
nioleotide 
ocation 
correspond i 
mg to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end i 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence. 


\mino acid sequence (A=Alanme C=Cysteine, 
3-Aspartic Acid, E=Glutamic Acid, 
r=Phenylalanine, G=Glycine, H==Histidine, 
=Isoleucine, fOLysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=GIutamine, R^Arginine, S=Serine, 
T-Threonine, V=V aline, W=Tryptophan, 
Y=Tyrosine, X^Unknown, *-Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 




1258 


2608 


A 


9911 


364 


1974 


AGPGVPAVGGRWASGPGLGGRTLCSGPFDH 

QRRGPSCGASGDPQCVGSPHPQRARPLLARP 

GARLLPGHLPSPRPPRLPTGQPPAAAFRGPVR 

PQGGGHIHPLPTPGGRPCFAVSEGSGSALLLS 

YLGECGSSSYVTGAACISPVLRCREWFEAGLP 

WPYERGFLLHQK1ALSRYATALEDTVDTSRL 

FRSRSLREFEEALFCHTKSFPISWDAYWDRND 

PLRDVDEAAVPVLCICSADDPVCGPPDHTLTT 

ELFHSNPYFFLLLSRHGGHCGFLRQEPLPAWS 

HEVILESFRALTEFFRTEEIUkGLSRHRAbr Lb 

G RRRGG ALQRRE V S S S SNLEEIFN WKRS YTRL 

MAAAAGAAAAPGSREPQDRPECGAGHPGPR 

YYRHPERWLLRPEAFLGPLR 1 KArbAbUbyK 

ERPAARSGPEMRVRYPVVAAVLAPYLALSQD 

PMVKSSASGQGASGSYNHVREEMLIKAGGA 

MSRRVVRQSKFRHVFGQAAKADQAYEDIRV 

SKVTWDSSFCAVNPKFLAIIVEAGGGGAFIVL 

PI.AK 




2609 


A 


9919 


693 


935 


G CFKFI G ESTCC WIFPSS VTTQC V V AKAPRAA 
TLSKAERLRSQPGPEQGGSSYRPRTPTAAAIL 
PPRPGRSHRKRKLVSTK 


1260 


2610 


A 


9921 


455 


1082 


QRSCLCSAIEKDGGDVKALYRRSQALEKLGR 

LDQAVLDLQRCVSLEPKNKVFQEALRN1GGQ 

IQEKVRYMSSTDAKVEQMFQDLLDPEEKGTE 

KKQKASQNLVVLAREDAGAEKIFRSNGVgLL 

QRLLDMGETDLMLAALRTLVGICSEHQSRTV 

ATLSILGTRRVVSILGVESQAVSLAACHLLQV 

MFDALKEGVKKGFRGKEGAIIV 


1261 


2611 


A 


9928 


1 


438 


GFRGAEAPGAAQAPKKKKPRPTEGGPGAUSG 
RGKDP Y RGPTLLHQPKPPKDEFLS SLES YEI AF 
PTRVDHNGALLAFSPPPPQKvJKiWj 1 u A 1 
RLF YKE ASPSTHFLLNLTRS SRLL AGH V S VE Y 
WTREGLAWQRADRPHCLYA 


1262 


2612 


A 


9931 


16S 


435 


AAEMGRAGAAAVlPGLALLWAvUJAjUri'.t'A 
PPRLPFCLQELQGRHALHTFSLERTCSYQDFL 
WADEGRLLHVGAQDLATWHTLSPLGLW 


1263 


2613 


A 


9938 


247 


488 


RMSATSVDQRPKGQGNKVSVQNGSIHQKUG 
CNDDDFEPYLRSPDNQSNSYPPMSDPYMPGY 
YAPSIGFPYSLGEAAWSQL 


1264 


2614 


A 


9941 


61 


277 


ESIGLTALGPRRIO* W tHKWMJn l i^ivivttvvj wvj 
WLALLLGALLGTAWARRSQDLHCGACKAVR 
RRVRQFNIYDY 


1265 


2615 


A 


9956 


2 


522 


" " FVASEVSKMPWASWPHPPGPFLLLTLLLGLT 
E V AGEEELQM1QPEKLLLVTVGKTATLHCTV 
TSLLPVGPVLWFRGVGPGRELIYNQKEGHFP 
RVTTVSDLTKRNNMDFSIRISS1TPADVGTYY 
CVKFRKGSPDHVEFKSGAGTELSVRGEYSVG 
FLSOVWWWLSSHPFMN 


1266 


2616 


A 


10002 


243 


387 


PKNNACHLLF 1 A vugPRCKHGbClOPNKCKU 
HPGYAGKTCNQGRKTV 


1267 


2617 


A 


10004 


36 


ncn 
i\j i 


LPAPASTWSVARETMASSSVPPATVSAATAG 

PGPGFGFASKTKKKHFVQQKVKVFRAADPLV 

GVFLWGVAHSTNELSQVPPPVMLLPDDFKAS 

SKIK VNNHLFHRENLP SHFKFKEY CPQ V FRKL 

RDRFG1DDQDYLVSLTRNPPSESEGSDGRFLIS 

YDRTLVIKEVSSEDIADMHSNLSNYHQVRPLS 

SPILSLSSLLTYSSAIVSNRCQLGRKLIGRENP 


1268 


2618 


A 


10005 


2 


209 


GEGYELFVPSNGVPAVCHMVGRRPHRAVLSP 
SQDELEHSLGESAAQGAAGVVLWVSWENTR 
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beginning 
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Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 


Amino acid sequence (A=Alanine C^Cysteine, 
D=Aspartic Acid, E=G!utamic Acid, 
, F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Pro)ine s 
Q=Glutaminc, R^Argininc, S=Scrine, 
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Y=Tyrosine, X = Unknown, *~Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














TKVSLGLA 


1269 


2619 


A 


10010 


245 


688 


FGMLKNKG H S SKKDN L A VN A V ALQDHIL HD 
LQLRNLSVADHSKTQVQKKENKS[.KRDTKAJ 
IDTGLKKTTQCPKLEDSEKEYVLDPKPPPL'1'L 
A^lU^uLluPrPPrLbbDhWEKVKQRSLLQGDS 
VQPCPICKEEFELRPQVFSIRG 


1270 




A 


i nm i 

1 UUl I 


z 




RVDDFVl^LPPGLMSRSRASIHRGSIPAMSYA 
PFRDVRGPSTHRTQYVHSPYDRPGWNPRFCII 
SGNQLLMLDEDEIHPLLIRDRRSESSRNKLLR 
RTVSVPVEGRPHGEHEYHLGRSRRKSVPGGK 
Kl Y oMEOy\PAAPFi^SQGFLSRJ<iKSSXKRTKS 
QPKLDRTSSFRQILPRFRSADHDRYRGWSMW 
DEIDV 


I27i 


2621 


A 


10013 


209 


363 


LPAPPNLSPRLSFGFQFPGGNDNYLTITGPSHP 
FLSGAEVSQSCRRRGGRA 


1272 


2622 


A 


10014 


7 


388 


SAVTISWKWRSVMGIQTSPALLASLGAGLVT 
LLGLAVGSYLVRRSRRPQVTLLDPNEKDLLR 
LIDKTLSARSPCKHIYLSTRIDGSLSIRPYTPVT 
SDEDQGYVDrDIKVYLKGVHPTFPEGGKMSH 


1273 


2623 


A 


10016 


1 


1339 


MAAR'l LGRGVGRLLGSLRGLSGQPARPPCGV 

SAPRRAASGPSGSAPAVAAAAAQPGSYPALS 

AQAAREPAAFWGPLARDTLVWDTPYMTVW 

DCDFSTGKIGWFLGGQLNVSVNCLDQHVRKS 

PESVALIWERDEPGTEVR1TYRELLETTCRLA 

NTLKRHG VHRGDR V AI YMP VS PL A V AAMLA 

CARIGAVHTV1FAGFSAESLAGRINDAKCKVV 

ITFNQGLRGGRVVELKKIVDEAVKHCPTVQH 

VLVAHRTDNKVF1MGDLDVPLEQEMAKEDP 

VCAPESMGSEDMLFMLYTSGSTGMPKGIVHT 

QAGYLLYAALTHKLVFDHQPGDIFGCVADIG 

W1TGHSYVVYGPLCNGATSVLFESTPVYPNA 

GRYWETVERLKINQFYGAPTAVRLLLKYGD 

AWVKJCYDRSSLRTLGSVGEPINCEAWEWLH 

RV VG DSRCTL VDT W WQT 


1274 


2624 


A 


10017 


1 


3750 


FRPQGTPRSPASHVLTMSAPDEGRRDPPKPKG 

KTLGSFFGSLPGFSSARNLVANAHSSARARPA 

ADPTGAPAAEAAQPQAQVAAHPEQTAPWTE 

KXLQPSEKMVSGAKDLVCSKJvlSRAKDAVSS 

GVASVVDVAKGVVQGGLDTTRSALTGTKEV 

VSSGVTGAMDMAICGAVQGGLDTSKAVLTG 

TICDTVSTGLTGAVNVAKGTVQAGVDTTKTV 

LTGTKDTVTTGVMGAVNLAKGTVQTGVETS 

KAVLTGTKDAVSTGLTGAVNVARGSIQTGV 

DTSKTVLTGTKDTVCSGVTGAA^NVAKGTIQT 

GVDTSKTVLTGTKDTVCSGVTGAMNVAKGT 

IQTGVDTSKT\rLTGTKDTVCSGVTGAMNVA 

KGTlQTGVDTTKTVLTGTKhTTVCSGVTGAVN 

LAKEA1QGGLDTTKSMVMGTKDTMSTGLTG 

AANVAKG.AMQTGLNTTQNIATGTICDTVCSG 

VTGAMNLARGTIQTGVDTTKIVLTGTKDTVC 

AVSTGLTGAVNVAKGTVQTGVDTTKTVLTG 
TKDTVCSGVTSAVNVAKGAVQGGLDTTKSV 
V1GTKDTMSTGLTGAANVAKGAVQTGVDTA 
KTVLTGTKJDTVTTGLVG AVN VAKGTVQTGM 
DTTKTVLTGTKDTIYSGVTSAVNVAKGAVQT 
GLKTTQN1ATGTKNTFGSGVTSAVNVAKGAA 
QTGVDTAKTVLTGTKDTVTTGLMGAVNVAK 
GTVQTSVDTTKTVLTGTKDWCSGVTGAAN 
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nucleotide 
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of peptide 
sequence 
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/=possible nucleotide deletion, V=possible 
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VAKGAIQGGLDTTKSVLTGTKDAVSTGLTGA 

VKLAKGTVQTGMDTTKTVLTGTKDAVCSGV 

TGAANVAKGAVQMGVDTAKTVLTGTKDTV 

CSGVTGAANVAKGAVQTGLKTTQNIATGTK 

NTLGSGVTGAAKVAKGAVQGGLDTTKSVLT 

GTKDAVSTGLTGAVNLAKGTVQTGVDTSKT 

VLTGTKDTVCSGVTGAVNVAKGTVQTGVDT 

AK.TVLSGAKDAVTTGVTGAVNVAKGTVQTG 

VDASKA V LMu 1 KJJ 1 V r bu v i UAivi;M\±/-vrvAj/\ 

VQGGLDTTKTVLTGTKD AVS AGLMG SGNVA 

TGATHTGLSTFQNWLPSTPATSWGGLTSSRT 

TDNGGEQTALSPQEAPFSGISTPPDVLSVGPEP 

a it re a a ATTf nT ATnV ATFTOHA APGRFDTG 
AWEA-AA1 IKuLAlUVAir iv^L3/Aj t vrvji.vc,j_/.i.vj 

LLA1THGPEEAPRLAMLQNELEGLGDIFHPM 

NAEEQAQLAASQPGPKVLSAEQGSYFVRLGD 

LGP SFRQRAFEH A VSHLQHG QFQ ARDTL AQL 

QDCFRL 


1275 


2625 


A 


10025 


124 


415 


TILAR.KKEKTCPCKKEIGRNSRSGMYSRKAM 
YKRivY S AA N I K V bKJs.Krs.fciv v L/\r v l rvr v vjvj 
DKNGGTRVVKLPTMPRYYPTEDVPRKLLSHG 
KKPFS 


1276 


2626 


A 


10030 


3 


507 


GGSLRFSPPRVPSCSRVFCPVPPGGCGLPSPMS 

ASRPQSPTTPWCLPRRYMKHKRDDGPEKQED 

EAVDVTPVMTCVFVVMCCSMLVLLYYFYDL 

LVYWIGIFCLASATGLYSCLAPCVRRiLPFGK 

CRIPNNSLP YFHKRPQ ARMLLL ALFCV AV S V 

VWGVFRNEDQ 


1277 


2627 


A 


10035 


51 


869 


YSRFTVPLPA I MAojDVAKnLLr^onivi/viJvi 

TCMSSQGSDDEQIKRENIRSLTMSGHVGFESL 

PDQLVNRSIQQGFCFNILCVGETGIGKSTLIDT 

LFNTNFEDYESSHFCPNVKLKAQTYELQESN 

VQLKLTIVNTVGFGDQINKEERQLGRSQSTEN 

T*r\is ci?/^Tjr>\/rrPVT< r r , Tc:FU/k'n A T GK WAG IE 

SSGQSAQQPYLPINSPPHRLADVADVHLFSSV 

LSGAFGCYHLDVTVNEFKKQQNRDEQEGYS 

vr , nncnnc\x/l<'T-iriAnPI RGOFM 
Js.ijl-'VfcW W ts^rtyjr\Ui JL»r\Ajvj,E»i.vi 


1278 


2628 


A 


10036 


3 


457 


RAFDVRRKKSLRPCCPRDFHAGCLTVSGPST 
VMGAVGESLSVQCRYEEKYKTFNKYWCRQP 
CLPIWHEMVETGGSEGVVRSDQVnTDHPGDL 
tctvti pxti TAnnAOKYRCGTATILOEDGLSG 
FLPDPFFQVQVLVSSASSTENSVKTP 


1279 


j 2629 


A 


10039 


214 


435 


-\Tr»QT VP\4Q<sWT? , srAR APSSESAWRRSAATRR 
SRKCLRTKRKRWSSGKGTQMQSTLSETPRRA 
QMPCMWWYPFWG 


1280 


2630 


A 


10043 


2 


344 


" RATWHNAGKEREAVQLMAGAEKRVKASHS 
FLRGLFGGNTRIEEACEMYTRAANMFKJVLAK 
NWSAAGNAFCQAAKLHMQLQSKFIDSATSFV 
DAGNAYKKADPQGKTARHVACYLCV 


1281 


2631 


A 


10080 


620 


818 


" VIYKLDSSLFSYFIYFFIFETESHFLPLMKWTG 
PIMAHCSLKILASRNSADSAFLSAGDTSLSHST 


1282 


2632 


A 




3 


1640 


SAS1URGDKRASGEVGIAPSSRH1LIGEPSAKY 

NGTAI1SLVRGPGILGEVTVFWRIFPPSVGEFA 

ETSGKLTMRDEQSAV1VVIQALNDD1PEEKSF 

YEFQLTA VS EGG VLSE S SSTANITV V A SD SPY 

GRFAFSHEQLRVSEAQRVNITIIRSSGDFGHVR 

LWYKTMSGTAEAGLDFVPAAGELLFEAGEM 

RKSLHVEILDDDYPEGPEEFSLTITKVELQGR 

GYDFTIQENGLQIDQPPEIGNISIVRIIIMKNDN 

AEGIIEFDPKYTAFEVEEDVGLIMIPV /RLHGT 
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nucleotide 
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Amino acid sequence (A=Alanine C— Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
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T=Threonine. V=Valine, W=Tryptophan. 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/possible nucleotide deletion, \=possible 
nucleotide insertion 














YGYVTADFISQSSSASPGGVDYILHGSTVTFQ 

HOQNLSFIN1SIIDDNESEFEEPIEILLTGATGG 

AVLGRHLVSRIIIAKSDSPFGVIRFLNQSKISIA 

NPNSTMILSLVXJERTGGLLGEIQWWETVGPN 

SQEALLPQNRDIADPVSGLFYFGEGEGGVRTII 

LTIYPHEEIEVEETFIIKLHLVKGEAKLDSRAK 

DVTLTI QEFGDPNG WQFAPETLSKKTYSEPL 

ALEGPLLITFFVRRVKGTFGEIM 


1283 


2633 


A 


10088 


316 


516 


MGSKTI.PAPVPIHPSLQLTNYSFLQAVNGLPT 
VPSDHLPNLYGFSALHAVHLHQWTLGYPAM 
HLXRS 


1284 


2634 


A 


10091 


2 


569 


FVSPSRAMASALIYVSKFKSFVILVVTPLLLLP 

LVILMPAKFVRCAYVIILMAIYWCTEVIPLAV 

TSLMPVLLFPLFQILDSRQVCVQYMKDTNML 

FLGGLIVAVAVERWNLHICRIALRTLLWVGA 

KPARLMLGFMGVTALLSMWISNTATTAMMV 

PIVEAILQQMEATSAATEAGLELVDKGKAKE 

LP 


1285 


2635 


A 


10092 


290 


728 


KQSTRPDVMTLYPLHWQEEMSGESVVSSAVP 
AAATRTTSFKGTSPSSKYVKLNVGGAEYYTT 
MQTLTKQDTMLKAMFSGRMEVLTDSEGW1L 
IDRCGICHFGTILNYLRDGAVPLPESRRE1EELL 
AEAKYYLVQGLVEECQAALQV 


1286 


2636 


A 


10100 


1 


574 


RPRGRGAWAGPGGDYSGVRRQQRRRTRISGS 

QRGSDAAGTMGCCTGRCSLICLCALQLVSAL 

ERQIFDFLGFQWAPILGNFLHIIVVILGLFGTIQ 

YRPRYIMV YT V WTAL W VTWN VEIICF YLE VG 

GLSKDTDLMTFN1SVHRSWWREHGPGCVRR 

VLPPS AHGMMDD YT Y VS VTG CI VDFQ YL E VI 

HSA 


12S7 


2637 


A 


10103 


252 


376 


RSRMGDKPIWEQIGSSFIQHYYQLFDNDRTQL 
GAIYVSFQL 


1288 


2638 


A 


10107 




478 


MEEEDESRGKTEESGEDRGDGPPDRDPTLSPS 

AFILRAIQQAVGSSLQGDLPNDKDGSRCHGL 

RWRRCRSPRSEPRSQESGGTDTATVLDMATD 

SFLAGLVSVLDPPDTWVPSRLDLRPGESEDM 

LELVAEVRIGDRDPIPLPVPSLLPRLRAWRTG 

KT 


1289 


2639 


A 


10 1 13 


237 


438 


LLSRMPSTNRAGSLKDPEIAELFFKEDPEKLFT 

DLREIGHGSFGAAYFARDVRTNEVVAIKKMS 

YSG 


1290 


2640 


A 


10114 


367 


856 


RGAKAKSAVLPPGPPCSSIL1LSPPAPLTPRSPG 

TEATRPTAMSKSLKKKSHWTSKVHESVIGRN 

PEGQLGFELKGGAENGQFPYLGEVKPGKVAY 

ESGSKLVSEELLLEVNETPVAGLTIRDVLAVI 

KHCKDPLRLKCVKQGESSGLLSVLPGGGTAR 

GAGQ 


1291 


2641 


A 


10116 


128 


591 


RTIRETERRSALSCSVLKSEPLPGLQPQASQQR 
RRRLPGRRQVQVQEGGGSGLRAWVLAMASV 
LGSGRG SGGLS S QLKCKSKJIRRRRRSKRKDK 
VSILSTFLAPFIsJiLSPGrnNTEDDDTLSTSSAE 
VKENRNVGNLAARPPPSGDRARGGATR 


1292 


2642 


A 


10121 


1 


749 


QRRRFRAGLWGGHGLTDGLRRNGGCGCSAR 

VPRVGERLRGHRCPDPLCLLLDMLFLSFHAG 

S\\TESWCCCCLIPADRPWDRGQHWQLEMADT 

RSVHETRFEAAVKVIQSLPKNG SFQPTNEMM 

LKFYSFYKQATEGPCKLSRPGFWDPIGRYKW 

DAWSSLGDMTK^EAJVnAYVEEMKKIIETMP 

MTEKVEELLRV1GPFYEIVEDKKSGRSSDITSD 
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M=Methionine, N=Asparagine, P=Proline, 
Q=Glutaminc, R=Arginine, S=Scrinc, 
T=Threonine, V=Valine, W=Jryptophan, 
Y=Tyrosine, X=Unkno\vn, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














LGNVLTSTPNAKTVNGKAESSDSGAESEEEE 
AC 


1293 


2643 


A 


10124 


2 


989 


PLMSLVRVVEFVAASSAQKTPSRLENYYMVC 

KADEKLFN QL VHFLRNHICQEKJ-1L VF F R YS S GL 

CGRGIRDSARMCSTCACVEYYGICALEVLVK 

GVKIMCIHGKMKYKRKIOFMEFRKLQSGILV 

CTDVMARGIDIPEVNWVLQYDPPSNASAFVH 

RCGRTARIGHGGSALVFLLPMEESYINFLAIN 

QKCPLQEMKPQKNTADIXPKLKSMALADRA 

VFEKGMICAFVSYVQAYAKHECNLIFRLKDL 

DFASLARGF ALLRMPKMPELRGKQFPDFVPV 

DVNTDT1PFKDKIREKQRQKLLEQQRREKTEN 

EGRRKFIKNICAWSKQKAKKK 


1294 


2644 


A 


10129 


91 


1042 


VTMYKDCIESTGDYFLLCDAEGPWGIILESLA 

JLGIVVTILLLLAFLFLMRKIQDCSQW^NVLPTQ 

LLFLLS VLGLFGLAF AFIIELNQQTAPVRYFLF 

GVLFALCFSCLLAHASNLVKLVRGCVSFSWT 

TILC1AIGCSLLQIIIATEYVTLIMTRGMMFVN 

MTPCQLN VDF VVLL V YVLFLN1ALTFF V SKAT 

FCGPCENWKQHGRLIFITVLFSIHWVVWISML 

LRGNPQFQRQPQWDDPWCIALVTNAWVFL 

LLY1VPELCILYRSCRQECPLQGNACPVTAYQ 

HSFQVENQELSRDKWKVLLNSDFLSHSGA 


1295 


2645 


A 


10133 


376 


518 


"RPRVVTHNSQWCFLPQDHPGWLPGQSGAPG 
GRG APRQEGPG S SWRQV 


1296 


2646 


A 


10135 


3 


551 


EWSLDPFMGIMSGQVGDLSPSQEKSLAQFRE 

N IQD VLS ALPNPDDYFLLRWLQARSFDLQKS 

EDMLRKHMEFRKQQDLANILAWQPPEVVRL 

YNANGICGHDGEGSPVWYHIVGSQDPKGLLL 

SASKQELLRDSFRSCELLLRECELQSQKLGKR 

VEIOIAIFGLEGLGLRDLNVKPGIELLQE 


1297 


2647 


A 


1013S 


48 


407 


MVSSCCGSVCSDQGCGQDLCQETCCRPSCCE 
TTCCRTTCCRPSCCVSSCCRPQCCQSVCCQPT 
CSRPSCCQTTCCRTTCYRPSCCVSSCCRPQCC 
QPVCCQPTCCRPSCCETTCCHPXCC 


1298 


2648 


A 


10156 


94 


453 


GGNRKSAEMFSQVPRTPASGCYYLNSMTPEG 
QEMYLRFDQTTRRSPYRMSRILARHQLVTKI 
QQEIEAKEACDWLRAAGFPQYAQLYEDSQFP 
INIVAVKNDHDFLEKDLGEPLCRRLNT 


1299 


2649 


A 


10161 


1 


393 


PRFSELVDGRGRVSARFGGSPSKAATVRSQPT 

ASAQLENMEEAPKRVSLALQLPEHGSKDIGN 

VPGNCSENPCQNGGTCVPGADAHSCDCGPGF 

KGRRCELACIKVSRPCTRLFSETKAFFVWEGG 

VCHHV 


1300 


2650 


A 


10162 


98 


391 


AK1ASLERIMP AN YTCTRPDG DNTDFRYFIY A 
VTYTGILGPGLIGNILALWVFYGYMKETKRA 
VIFMINLAIADLLQVLSLPLRIFYYLKHDWPF 

VPV 


1301 


2651 


A 


10165 


1 


7545 


" PGIRVGITSQTGLSSNLQENCSKLAFISSHGTE 
KQLQCMPMEGRGRASSSISDLQGKGFEKGTG 
PTTMVpnvn^A^RHSPOASAGGSPWORGKAQT 
R WLGKPDPGRKRRRGSPQEEGGLRV S AAAR 
LLCSGANRCKVLVRQNSTPNTQQPAVHPSTP 
PSRPLPQAGRCLVAPLRPHPDWVAAKTLAKA 
LRAPGKPWRLAAPSPLGDLGAPGLPGPSTAP 
RTLSVEEPGVECNQLCLYADVTDPVLCLGQK 
DPGVEGKHCEKEKISSSKELKHVHAKSEPSKP 
ARRLSESLH\^VDENKNESFaEREHKRRTSTPV 
IMEGVQEETDTRDVKRQVERSE1CTEEPQKQ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: ot 
peptide 
seq- 
uence 


1 Met 
nod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteinc, 
D=Aspartic Acid, E=Glutamic Acid, 
F= Phenylalanine, G=Glycine, H=Histidine, 
I=Iso leucine, K==JLysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine. V=Valine, W=Tryptophan. 
Y=Tyrosine, X=Unknovvn, *=Stop codon, 
/=possib!e nucleotide deletion, \=possible 
nucleotide insertion 














KSTLKNEKHLKKDDSETPHLKSLLKKEVKSS 

KJEKPEREKTPSEDIO.SVKHKYKGDCMHKTG 

DETELHSSEKGLKVEENIQKQSQQTKLSSDDK 

TERKSKHRNERKLSVLGKDGKPVSEYIIKTDE 

NVRJCENNKKJil<RLS AEICl'KAEHKSRRS SDSK 

IQKDSLGSKQHGITLQRRSESYSEDKCDMDST 

NMDSNLKPEEVVHKEKRRTKSLLEEKLVLKS 

KSKTQGKQVKVVETELQEGATKQATTPKPD 

KEKNTEENDSEKQRKSKVEDKPFEETGVEPV 

LETASSSAHvSTQKDSSHRAKLPLAKEKYKSD 

KDSTSTRLERKLSDGHKSRSLKJiSSIODIKKKD 

ENKSDDKDGKEVDSSHEKARGNSSLMEKKL 

SRRLCENRRGSLSQEMAKGEEKLA.ANTLSTP 

SGSSLQRPKKSGDMTLIPEQEPMEIDSEPGVE 

NVFEVSKTQDNRNNNSHQDIDSENMKQKTS 

ATVQKDELRTCTADSKATAPAYKPGRGTGV 

NSN SE104ADHRSTL1KKMHIQSAVSKMNPGE 

KEPI HRGTTE VNIDSET VHRMLLS APSENDRV 

QKNLKNTAAEEHVAQGDATLEHSTNLDSSPS 

LSSVTVVPLRESYDPDVIPLFDKRTVLEGSTA 

STSPADHSALPNQSLTVRESEVLKTSDSKEGG 

EGFTVDTPAICASITSKRHIPEAHQATLLDGKQ 

GKVIMPLGSKLTGVIVENENirivEGGLYDMA 

KKENDLNAEPNLKQTIKATVENGKKDGIAVD 

HVVGLNTEKYAETVKLKHKRSPGKVKDISID 

VERRNENSEVDTSAGSGSAPSVLHQRNGQTE 

DVATGPRR-AEKTSVATSTEGICDKDVTLSPVK 

AGPATTTSSETRQSEVALPCTSIEADEGLIIGT 

HSRNNPLHVGAEASECTVFAAAEEGGAWTE 

GFAESETFLTSTKEGESGECAVAESEDRAADL 

LAVHAVKIEANVNSVVTEEKDDAVTSAGSEE 

KCDGSLSRDSEIVEGTITFISEVESDGAVTSAG 

TEIRAGSISSEEVDGSQGNMMRMGPKKETEG 

TVTCTGAEGRSDNFVICSVTGAGPREERMVT 

GAGVVLGDNDAPPGTSASQEGDGSVNDGTE 

GESAVTSTG1TEDGEGPASCTGSEDSSEGFAJS 

SESEENGESAMDSTVAKEGTNVPLVAAGPCD 

DEGIVTSTGAKEEDEEGEDWTSTGRGNEIGH 

ASTCTGLGEESEGVLICESAEGDSQIGTWEH 

VEAEAGAAIMNANENNNDSMSGTEKGSKDT 

DICSSAKGIVESSYTSAVSGKDEVTPVPGGCE 

GPMTSAASDQSDSQLEKVEDTTISTGLVGGS 

YDVLVSGEVPECEVAHTSPSEKEDEDIITSVE 

NEECDGLMATTASGDITNQNSLAGGKNQGK 

VLIISTSTTNDYTPQVSAITDVEGGLSDALRTE 

ENM EGTR VTTEEFE APMPS A VS GDDS QLT AS 

RSEEKDECAMISTSIGEEFELPISSA1TIKCAES 

LQPVAAAVEERATGPVLISTADFEGPMPSAPP 

EAESPLASTSKEEKDECALISTSIAEECEASVS 

GVVVESENERAGTVMEEKDGSGIISTSSVEDC 

EGPVSSAVPQEEGDPSVTPAEEMGDTAMISTS 

X otuLCA VlvlloA vL,v^lJbJjrCI * III Kvfc,LJiwoUA 

A1ISTSTAECMPISAS1DRHEENQLTADNPEGN 

GDLSATEVSKHKVPMPSLIAENNCRCPGPVR 

GGKEPGPVLAVSTEEGHNGPSVHKPSAGQGH 

PSAVCAEKEEKHGKECPEIGPFAGRGQKESTL 

HLINAEEKNVLLNSLQKEDKSPETGTAGGSST 

ASYSAGRGI .EGN ANSP AHLRGPEQTSGQTAK 

DSSVSSIRYLAAVNTGAIKADDMPPVQGTVA 

EHSFLPAEQQGSEDNLKTSTTKCITGQESKJAP 
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PCT/U SO 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D=Aspartic Acid, E^Giutamic Acid, 
F=Phenyiaianine 3 G=Glycine, H=Histidine, 
I=Isoieucine 7 K=Lysine, L^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=G lutaminc, R=Arginine, S^Serinc, 
T--Threonine 3 V-=Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknovvn, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














SHT3VQPPATYSVALLAPKCEQDLTIKMDYSGK 

WTDQASAEKTGDDNSTRKSFPEEGDIMVTVS 

SEENVCDIGNEESPLNVLGGLKLKANLKMEA 

YVPSEEEKNGEILAPPESLCGGKPSG1AELQRE 

PLLVNESLNVENSGFRTNEEIHSESYNKGEISS 

GRKDNAEAISGHSVEADPKEVEEEERHMPKR 

KJlkQriYLSSEDEPDDNPDVLDSRIETAQRQC 

PETEPHATKEENSRDLEELPKTSSETNSTTSRV 

MEEKDEYSSSETTGEKPEQNDDDTIKSQE 


1302 


2652 


A 


10167 


321 


842 


EPSLFPFLRPSPARPPPRPPAPI-PSPELAGPEPH 

FVFYFFLSYVHPPKELAKYEYMEEQVILTEKG 

NSTVAGRGTSVRCLSPSPRPLPPLLPLLADLLE 

DGFGEHPFYHCLVAEVPKEHWTPEGNPSPFP 

EARETKCYVRSSVGCVEPLTTQAEVTENLDR 

KNSQQVFKLLKKK 


1303 


2653 


A 


10171 


206 


429 


NMILLKKRRLLINSLGEGTINGLLDELLETNV 
LSQEDTEIVKCENVTVIDKARDLLDSV1RKGA 
RACE1CITYI 


1304 


2654 


A 


10184 


970 


1524 


LCTLSPG1SGTAGSCLTTEPGTELGTSFAQNGF 

YHEAVVLFTQALKLNPQDHRLFGNRSFCHER 

LGQPAWALADAQVALTLRPGWPRGLFRLGK 

ALMGLQRFREAAAVFQETLRGGSQPDAAREL 

RSCLLHLTLQGQRGGICAPPLSPGALQPLPHA 

ELAPSGLPSLRCPRSTALRSPGLSPLLH 


1305 


2655 


A 


10194 


2 


394 


TDLLGRRFRVDGAAMAACEGRRSGALGSSQ 

SDFLTPPVGGAPWAVATTYVMYPPPPPPPHR 

DFISVTLSFGESYDNSKSWRRRSCWRKWKQL 

SRLQRNMILFLLAFLLFCGLLFYINLADHWKG 

1RNTCT 


1306 


2656 


A 


10195 


1 


410 


IPGSTrSLEGPLSKWTNVMKGWQYRWFVLDY 
NAGLLSYYTSKDKMMRGSRRGCVRJLRGAVI 
G1DDEDD STFTIT VDQKTFHFQARDADEREK 
WIHALEETILRHTLQLQVRVFTWFPDSSLVGA 
FFFWLVSGFFFK 


1307 


2657 


A 


10205 


85 


308 


QGLPSTMVKLGCSFSGKPGKDPGDQDGAAM 
DSVPLISPLDISQLQPPLPDQVVIKTQTEYQLS 
SPDQQNYTKSR 


1308 


2658 


A 


10214 


2 


453 


ECGGIRQPGPGPPPALASAPAATMNRVGGSPS 
AAANYLLCTNCRKVLRKDKRIRVSQPLTRGP 
SAFIPEKEVVQANTVDERTNFLVEEYSTSGRL 
DNITQVMSLHTQYLESFLRSQFYMLRMDGPL 
PLPYRHYIAIMAAARHQCSYLINM 


1309 


2659 


A 


10233 


45 


421 


RGWPEQQSTGRPRDVARQPRCQKEEGRRLRP 

RALESRTFQGSERSRWGPPLESTKENVQCGH 

RPAFPNSSWLPFHERLQVQNGECPWQVS1QM 

SRJKHLCGGSILHWWWVLTAAHCFRRTLLDM 

AV 


1310 


2660 


A 


10241 


243 


442 


AFQLFNAKCESAFLSKRNPLQRNWTVLYRRK 
HKJCGQSAEIQKKJ^TRRAFKFQRAITGASLADI 
MAK 


1311 


2661 


A 


10261 


751 


176 


i v>n AnvnnriHl ^1 RT FH1 T LTSAAWVPDESO 

VTLNSAICVLSTVLIMEFPDLGKHCSEKTCKQ 

LDFLPVKCDACKQDFCKDHFPYAAHKCPFAF 

QKDVHVPVCPLCNTPIPVKKGQIPDVWGDHI 

DRDCDSHPGKKKJEK1FTYRCSKEGCKKKEML 

QMVCAQCHGNFCIQHRHPLDHSCRHGSRPTI 

KAG 


1312 


2662 


A 


10270 


3 


669 


STSSDEGSPSASTPMINKTGFKFSAEKPVIEVP 
SMTILDKKJ)GEQAKALFEKVRKFRAHVEDSD 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyJalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine. L=Leucine, 
M~Methionine, N=Asparagine, P=Proline, 
Q=Glutamine. R=Arginine, S=Serine, 
T=Threonine, V=V aline, W=Tryptophan, 
Y=Tyrosine, X=Unfcno\vn, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














LIYKLYVVQTVIKTAKFIFILCYTANFVNA1SF 

EHVCKPKVEHL1GYEVFECTHNMAYMLKKL 

LISYISI1CVYGFICLYTLFWLFRIPLKEYSFEKV 

REESSFSDIPDVKNDFAFLLHMVDQYDQLYS 

KRFGVFLSEVSENKLREISLNHEWTFEKL 


1313 


2663 


A 


10287 


1221 


266 


GA1IRVLSPAQGAQPRLRSAASVEVSMVGQR 

VLLLVAFLLSGVLLSEAAKILTISTLGGSHYLL 

LDRVSQILQEHGHNVTMLHQSGKFLIPDIKEE 

EKSYQVIRWFSPEDHQKRIKKHFDSYIETALD 

GRKESEALVKLME1FGTQCSYLLSRKDIMDSL 

KNENYDLVFVEAFDFCSFL1AEKLVKPFVAIL 

PTTFGSLDFGLPSPLSYVPVFPSLLTDHMDFW 

GRVKNFLMFFSFSRSQWDMQSTFDNTIKEHF 

PEGSRP VLSHLLLKAELWFVNSDCAFDFA RPL 

LPNTVYIGGLMEKPIKPVPQVSEPSAFSLGFT 


1314 


2664 


A 


10288 


536 


1890 


NVQLAKFSSTLVFFFSCDADPSALAKYVLAL 

VKKDKSEKELKALC1DQLDVFLQKETQIFVEK 

LFDAVNTKSYLPPPEQPSSGSLKVEFFPPQEK 

DIKKEEITKEEEREKKFSRRLNHSPPQSSSRYR 

ENRSRDERKKDDRSRKRDYDRNPPRRDSYRJD 

R YNRRRGRSRS Y SRSRSRS W SKERLRERDRD 

RSRTRSRSRTRSRERDLVICPKYDLDRTDPLEN 

WTPVSSVPSISSGHYPVPTLSSTITVIAPTHHG 

NNTTESWSEFHEDQVDHNSYVRPPMPKKRC 

RDYDEKGFCMRGDMCPFDHGSDPVVVEDVN 

LPGMQPFPAQPPVVEGPPPPGLPPPPP1LTPPPV 

NLRPPVPPPGPLPPSLPPVTGPPPPLPPLQPSG 

MDAPPNSATSSVPTVVTTGIHHQPPPAPPSLFT 

ADTYDTDGYNPEAPSITNTSRPMYRHRVHPR 

AI<CLG 


1315 


2665 


A 


10293 


447 


1331 


SHPLLSCPEKVSAKLRAAAEAAAEERRTRGA 

GSRGICAGLRSVAPGPEPLKQEEGRREWGSSI 

GTPSPCGSAQAAAAAAAEEATEKfPALRPALL 

W ALLAL WLCCATPAHALQCRDG YEPCVNEG 

MCVTYHNGTGYCKCPEGFLGEYCQHRDPCE 

KNRCQNGGTCVAQAMLGICATCRCASGFTGE 

DCQYSTSHPCFVSRPCLNGGTCHMLSRDTYE 

CTCQ VGFTGKNPKCPG GNLNYQFNG 1 1 VVYS 

GGSVPPSGTKTSKPAEHNAMGTGSKNFASGT 

LWVMVSGATSTSTSTL 


1316 


2666 


A 


10294 


118 


572 


SLSMESNHK5GDGLSGTQKEAALRALVQRTG 

YSLVQENGQRKYGGPPPGWDAAPPERGCEIFI 

GKLPRDLFEDELIPLCEKIGKIYEMRMMMDF 

NGNNRGYAFVTFSNKVEAKLNAIKQLNNYE1R 

NGRLLGVCASVDNCRLFVGGIPFCTKK 


1317 


2667 


A 


10301 


158 


1956 


LLKSCGVLLSGVCIPCEGKGPTVLVIQTAVPQ 

DRPTKSSMRSAAKPWNPAJRAGGHGPDRVRP 

LPAASSGMKSSKSSTSLAFESRLSRLKRASSE 

DTLNKPGSTAASGVVRLKKTATAGAISELTES 

RLRSGTGAFTTTKRTGIPAPREFSVTVSRERSV 

PRGPSNPRKSVSSPTSSNTPTPTKHLRTPSTKP 

KQENEGGEKAALESQVRELLAEAKAKDSEIN 

RLRSELKKYKEKRTLNAEGTDALGPNVDGTS 

VSPGDTEPMIRALEEKNKNFQKJELSDLEEENR 

VLICEKLlYLEHSPNSEGAASHI'GDSSCPTSrrQ 

ESSFGSPTGNQLSSDIDEYKKNIHGNALRTSG 

SSSSDVTKASLSPDASDFEHITAETPSRPLSSTS 

NPFKSSKCSTAGSSPNSVSELSLASLTEKIQKM 

EENHHSTAEELQATLQELSDQQQMVQELTAE 
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PCT/U SO 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence j 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, HNHistidine, 
I=Iso leucine, K^Lysine, L=Leucine, 
M=Methioninc, N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V^Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














NEKLVDEKTILETSFHQHRERAEQLSQENEKL 

MNLLQERVKNEEPTTQEGKIIELEQKCTGILE 

QGRFEREKI./LNIQQQLTCSLRKVEEENQGAL 

EMlKRLKEENEKLNEFLELElOiNNNMMAKTL 

EECRVTLEGLKMENGSLKSHLQG 


1318 


2668 


A 


10303 


333 


879 


GECFIMAAVVQQNDLVFEFASNVMEDERQL 

GDPAIFPAVIVEHVPGADILNSYAGLACVEEP 

NDMITESSLDVAEEEIIDDDDDDITLTVEASCH 

DGDETIETIEAAEALLNMDSPGPMLDEKIUNN 

NIFSSPEDDMVVAPVTHVSVTLDG1PEVMETQ 

QVQEKYADSPGASSPEQPKRKKK 


1319 


2669 


A 


10322 


169 


654 


MEVRMSGSVAVTRAIAVPGLLLLLIIATALSL 

LIGAKSLPASWLEAFSGTCQSADCTIVLDAR 

LPRTLAGLLAGGALGLAGALMQTLTRNPLAD 

PGLLGVNAGASFAIVLGAALFGYSSAQEQLA 

MAFAGALVASLIVAFTGSQGGGQLSPVRLTL 

AGVXL 


1320 


2670 


A 


10323 


441 


2 


KMNQVAVVIGGGQTLGAFLCHGLAAEGYRV 

AVVDIQSDKAANVAQEINAEYGESMAYGFG 

ADATSEQSVLALSRGVDEIFGRVDLLVYSAGI 

AKAAFISDFQLGDFDRSLQVNLVGYFLCARE 

FSRLMIRDGIQGRIIQINSKSDE 


1321 


2671 


A 


10332 


1 


453 


RHRTAGPGSTISSRTDSASAPAARAMPCEYTY 

AKLTSDCSRPSLQWYTRAQSKMRRPRLLLKD 

ILKCTLLVFGVRILYILKLNYTTEECDMECNMH 

YVDPDHVKRAQKYAQQVLQKESPPKFAKTS 

MALLFEHRYSVDLLPFVQKAPTDSEA 


1322 


2672 


A 


10333 


25 


423 


EPSNGPVVYSALGNEDDE1LLLGKDIIGTFAAS 

ERKMRAHQVLTFLLLFVITSGASENASTSRGC 

GLDLLPQNVYLCDLDAIWGTWEAVAGAGA 

LITLLLMLILLGRLPF1KEKEKKSPAVLHFLFL 

LGTLG 


1323 


2673 


A 


10334 


52 


426 


SSLGNEDDE1LSLAKDITGMFVASFIRKMRAH 
QVLTFLLLFVITSVASENASTSRGCGLDLLPQ 
YVS LCDLDAI WGI V VE AAAG AG AL rTLLLMLI 
LLVREPFFKEKEKKSPVGLHFLFLLGTLGP 


1324 


2674 


A 


10336 


1 


932 


ERLCFPCMQSKIYSYMSPNKCSGMRFPLQEE 

NSVTHHEVKCQGKPLAGIYRKREEKRNAGN 

AVRSAMKSEEQKIKDARKGPLVPFPNQKSEA 

AEPPKTPPS SCD STN AAI AKQ ALKKPI KGKQ A 

PRKKAQGKTQQNRKLTDFYP VRRS SRKSKAE 

LQSEERKRJDEL1ESGKEEGMKIDLIDGKGRG 

VIATKQFSRGDFV\^YHGDLIEITDAKKREAL 

YAQDPSTGCYMYYFQYLSKTYCVDATRETN 

RLGRLINHSKCGNCQTKLHDIDGVPHLILIAS 

RDIAAGEELLYDYGDRSKASIEAHPWLKH 


1325 


2675 


A 


10338 


3 


870 


PGSTISCSELKGTQCRATAGSRGRRPPMTCWL 

RGVTATFGRPAEWPGYLSHLCGRSAAMDLG 

PMRKSYRGDREAFEETHLTSLDPVKQFAAWF 

EEAVQCPDIGEANAMCLATCTRDGKPSARML 

t i T^r , T?r^i<rr>fTRl?FFTKrFFSRKnKFLDSNPFASL 

VFYWEPLNRQVRVEGPVKKLPEEEAECYFHS 

RPKS SQ1GA VV SHQS S VIPDREYLRKKNEELE 

QLYQDQEVPKPKSWGGYVLYPQVMEFWQG 

QTNRLHDRIVFRRGLPTGDSPLGPMTTIRGEE 

DWLYERLAP 


1326 


2676 


A 


10344 


2 


984 


ARAAAHCGTCRLVRWWRKRRSVMGIQTSPV 
LLASLGVGLV1IXGLAVGSYLVRRSRRPQVT 
LLDPNEKYLLRLLDKTT V SHNTKRFRF ALPTA 



327 



MSDOCID- <WO 0157188A2 I > 



WO 01/57188 




T/U SO 1/03800 



* SEQ ID 
NO; of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO; of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO; 
in 

USSN 
09/496 
914 


Predicted . 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, 
D=Asparlic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H^Histidine, 
I=Isoleucine, K=Lysine 5 L=Leucine, 
M=Methionine, N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine s 
T=Thrconinc. V=Valinc, W=Tryptophan, 
Y=Tyrosine, X=Unkno\vn, *=Siop codon, 
/^possible nucleotide deletion, \=possib!e 
nucleotide insertion 














HHTLGLPVGKHIYLSTRIDGSLVIRPYTPVTSD 

EDQGYVDLVIKVYLKGVHPKFPEGGKMSQY 

LDSLKVGDVVEFRGPSGLLTYTGKGHFNIQP 

NKJCSPPEPRVAKKLGMIAGGTGITPMLQLIRA 

ILKVPEDPTQCFLLFANQTEKDULREDLEELQ 

ARYPNRFKLWFTLDHPPKDWAYSKGFVTAD 

MIREHLPAPGDDVLVLLCGPPPMVQLACHPN 

LDKLGYSQKMRFTY 


1327 


2677 


A 


10345 


1 


968 


LQSAGEGVTHVLILLESPARPVAAVTQVQRR 

RYHR1..SDMSMLAERRKKQKWAVDPQNTAW 

SNDDSK.FGQRMLEKMGWSKGKGLGAQEQG 

ATDHIKVQVKKNHLGLGATINNEDNWIAHQ 

DDFNQLLAELNTCHGQETTDSSDKKJEKKSFS 

LEEKSKISKNKVHYMKFTKGKDLSSRSKTDL 

DCIFGKRQSKKTPEGDASPSTPEENETTTTSAF 

TI QE YF AKRMA ALICNKPQ VP VPGSDI SETQ VE 

RJ<i<GKKRNKEATGKDVESYLQPKAiCRHTEG 

ICPERAEAQERVAKKKSAPAEEQLRGPCWDQ 

S SKAS AQD AGDHVQPA 


1328 


2678 


A 


10346 


173 


439 


GSAAMICVKIKCWNGVATWLWVANDENCGI 
CRMAFNGCCPDCKVPGDDCPLVWGQCSHCF 
HMHCILKWLHAQQVQQHCPMCRQEWKJ^KE 


1329 


2679 


A 


10351 


3 


964 


QMEPGNDTQ1SEFLLLGFSQEPGLQPFLFGLFL 

SMYLVTVLGNLLIILATISDSHLHTPMYFFLSN 

LSFADJCVTSTTIPKMLMNIQTQNKV7TY1ACL 

MQMYFFILFAGFENFLLSVMAYDRFVAICHP 

LHYM VIMNPHLCGLL VL A S WTMS AL YSLLQI 

LMVVRLSFCTALEIPHFFCELNQV1QLACSDSF 

LNHMVIYFTVALLGGGPLTGILYSYSKIISSIH 

AISSAQGKYKAFSTCASHLSVVSLFYGAILGV 

YLSSAATRNSHSSATASVMYTVVTPMLNPFI 

YSLRNKDIKRALGIHLLWGTMKGQFFKKCP 


1330 


2680 


A 


10352 


34 

J 


2573 


IPFLKSCCCCCLFDFPPPPLDQVQEEECEVERV 

TEHGTPKPFRKFDS VAFGESQSEDEQFENDLE 

TDPPNWQQLVSREVLLGLKPCEIKRQEVINEL 

FYTERAHVRTLKVLDQVFYQRVSREGJLSPSE 

LRKIFSNLEDILQLHIGLNEQMKAVRKRNETS 

VIDQIGEDLLTWFSGPGEEKLKFIAAATFCSNQ 

PFALEMIKSRQKKDSRFQTFVQDAESNPLCRR 

LQLKDIIPTQMQRLTKYPLLLDNIATYTEWPT 

EREKVKK/\ADHCRQILNYVNQAVKEAENKQ 

RLEDYQRRLDTSSLKLSEYPNVEELRNLDLTK 

RKMIHEGPL VWK VNRDKTIDL YTLLLEDIL V 

LLQKQDDRLVLRCHSKILASTADSKHTFSPVI 

KLSTVLVRQVATDNICALFVISMSDNGAQIYE 

LVAQTVSEKTVWQDLICRMAASVKEQSTKPI 

PLPQSTPGEGDNDEEDPSKLKEEQHGISVTGL 

QSPDRDLGLESTLISSKPQSHSLSTSGKSEVRD 

LFVAERQFAKEQHTDGTLKEVGEDYQ1AIPDS 

HLPVSEERWALDALRNLGUXQLLVQQLGLT 

EKSVQEDWQHFPRYKTASQGPQTDSVJQNSE 

NIKAYHSGEGHMPFRTGTGDIATCYSPRTSTE 

SFAPRDSVGLAPQDSQASNILVMDHMIMTPE 

MPTMEPEGGLDDSGEHFFDAREAHSDENPSE 

GDGAVNKEEKDVNLRISGNYLILDGYDPVQE 

SSTDEEVASSLTLQPMTGIPAVESTHQQQHSP 

QNTHSDGAISPFTPEFLVQQRWGAMEYSCFEI 

QSPSSCADSQSQIMEYIHKIEADLEHLKKVEE 

SYTILCQRLAGSALTDKHSDKS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end * 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine, K==Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P-Proline, 
Q=Glutaminc, R=Arginine, S=Serine, 
T=Threonine, V-Valine, W^Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion. \=possible 
nucleotide insertion 


1331 


2681 


A 


10353 


1 


2100 


AVEFAEGALTMAPWPELGDAQPNPDKYLEG 

AAGQQPTAPDKSKETNKTDNTE.APVTKIELLP 

SYSTATLIDEPTEVDDPWNLPTLQDSGIKWSE 

RDTKGKILCFFQGIGRLILLLGFLYFFVCSLDIL 

SSAFQLVGGKMAGQFFSNSSIMSNPLLGLVIG 

VLVTVLVQSSSTSTSIVVSMVSSSLLTVRAAIP 

IIMGANIGTSITNT1VALMQVGDRSEFRRAFA 

GATVHDFFNWLSVLVLLPVEVATHYLEIITQL 

I VESFHFKNGEDAPDLLKVITKPFTKI ,TVQLDK 

KVISQ1 AMNDEKAKNKSLVKI WCKTFTN KTQ 

IN VT VPSTANCTSPSLCWTDGIQN WTMKNVT 

YKENIAKCQH1FVNFHLPDLAVGTILLILSLLV 

LCGCLIMIVKILGSVLKGQVATVIKKTINTDFP 

FPFAWLTGYLAILVGAGMTFIVQSSSVFTSAL 

TPLIGIGVITIERAYPLTLGSNIGTTTTAILAAL 

ASPGNALRSSLQIALCHFFFNISGILLWYP1PFT 

RLPIRMAKGLGNISAKYRWFAVFYLIIFFFLIP 

LTVFGLSLAGWRVLVGVGVPVVFII1LVLCLR 

LLQSRCPRVLPKKLQN^'NFLPLWMRSLKPW 

DAVVSKFTGCFQMRCCCCCRVCCRACCLLC 

GCPKCCRCSKCCEDLEEAQEGQDVPVKAPET 

FDNITISREAQGEVPASDSKTECTAL 


1332 


2682 


A 


10354 


30 


1377 


"SQQGSQPHRQGPPSLLTAPHSLDLPALPPGPR 
GSQGKLRRVLVPMSVKPSWGPGPSEGVTAVP 
TSDLGEIHNWTELLDLFNHTLSECHVELSQST 
KRV VLF AL YL AMF V VGL VENLL VICVN WRG 
SGRAGLMNL YILNMAI ADLGTVLSLPV WMLE 
VTLDYTWLWGSFSCRFTHYFYFVNMYSSIFF 
LVCLSVDRYVTLTSASPSWQRYQHRVRRAM 
CAG1WVLSAIIPLPEVVHIQLVEGPEPMCLFM 
APFETYSTWALAVALSTTILGFLLPFPL1TVFN 
VLTACRLRQPGQPKSRRHCLLLCAYVAVFV 
MCWLPYHVTLLLLTEHGTHISLHCHLVHLLY 
FF YD VI DCF SMLHC VINP1L YNFL SPHFRGRLL 
NAVVHYLPKDQTKAGTCASSSSCSTQHSinT 
KGDSQPAAAAPHPEPSLSFQAHHLLPNTSPISP 
TQPLTPS 


1333 


2683 


A 


10358 


2 


884 


AAGAGADGREPASERASRAEPPAVAMGQND 

LMGTAEDFADQFLRVTKQYLPHVARLCLIST 

FLEDGIRMWFQWSEQRDYIDTTWNCGYLLA 

SSFVFLNLLGQLTGCVLVLSRNFVQYACFGLF 

GI1ALQT1AYSILWDLKFLMRNLALGGGLLLL 

LAESRSEGKSMFAGVPTMRESSPKQYMQLGG 

RVLLVLMFMTLLHFDASFFSIVQNIVGTALMI 

L V Al GFKTKL AALTLV Y WLFAINVYFN AF WT 

1PVYKPMHDFLKYDFFQTMSVIGGLLLVVAL 

GPGGVSMDEKKKEW 


1334 


2684 


A 


10367 


59 


1562 


QAWSLQVALSPFFFPASPSNSFAAAVPQLLFP 

ELPLPHVPGQESAKRRSARRFLIMSELTKELM 

ELVWGTKSSPGLSDTIFCRWTQGFVFSESEGS 

ALEQFEGGPCAVIAPVQAFLLKKLLFSSEKSS 

\x/Drkr , QnFPnivFr \ CHTI CDILESACCDHSGS 

YCLVSWLRGKTTEETASISGSPAESSCQVEHS 

SALAVEELGFERFHALIQKRSFRSLPELKDAV 

LDQYSMWGNKFGVLLFLYSVLLTKGIENTKN 

EIEDASEPLIDPVYGHGSQSLINLLLTGHAVSN 

VWDGDRECSGMKLLGIHEQAAVGFLTLMEA 

LRYCKVGSYLK1SKIPYLDCLASETHLTVFFA 

ICDMALVAPEAPSEQARRVFQTYDPEDNGFIP 

DSLLED VMKALDLV SDPEYINLMKNKLDPEG 
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sISDOCtD <WO 0157188A2J_> 



WO 01/57188 




T/US01/03800 



SEQ1D 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
• uence 


Met 
hod 


SEQ 
ir> no- 

XLJ in \j . 

in 

USSN 
09/496 
914 


Predicted 

kon inni'no 
DCt^ll lllllltd, 

nucleotide 
location 
enrrt^nond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucieoiiuc 
location 
corresponding 

tit la<it aminn 

acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D = Aspartic Acid ; E^GIutamic Acid, 
F=PhenyIalanine s G=Clycine s H=IIistidine, 
Msoleucine, K=Lysine. D=Leucine, 
ivi ivicuiiuiiiiic, in — /A-bpai agiiic, r — i roiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T='i hreonine 5 V= Valine, W-'i ryptophan, 
Y=Tyrosine, X-Unknown. *=Stop codon, 
/=possiblc nucleotide deletion, V=possible 
nucleotide insertion 














l.GTRJXiPFLQEFFPDQGSSGPESFTVYHYNGL 
KQSN YNEK VM Y V EG TAW MG FEDPMLQTD 
DTPIKRCLQTK WP YIELL WTTDRSPSLN 


1335 


2685 


A 


10375 


82 


2929 


TRTKRRLGREKAMASPPRGWGCGELLLPFML 

LGTLCEPGSGQIRYSMPEELDKGSFVGNIAKD 

LGLEPQET AER G VRI VSR GRTQLF ALNPRSGS 

LVTAGRIDREELCAQSPLCVVNFNILVENKM 

KIYGVEVEIIDINDNFPRFRDEELKVKVNENA 

AAGTRLVLPFARDADVGVNSLRSYQLSSNLH 

FSLDVVSGTDGQKYPELVLEQPLDREKETVH 

DLLLTALDGGDPVLSGTTHIRVTVLDANDNA 

PLFTPSEYSVSVPENIPVGTRLLMLTATDPDE 

G 1NGKXT Y S FRN EEEKJS ETFQLDSN LGE1 STL 

QSLDYEESRFYLMEVVAQDGGALVASAKVV 

VT^^QDVNDNAPEVILTSLTSSISEDCLPGTVIA 

LFSVHDGDSGENGE1ACSIPRNLPFKLEKSVD 

NYYHLLTTRDLDREETSDYNITLTVMDHGTP 

PLSTESHIPLK1VADVNDNPPNFPQASYSTSVT 

ENNPRGVS1FSVTAHDPDSGDNARVTYSLAE 

DTFQGAPLSSYVSINSDTGVLYALRSFDYEQL 

RDLQLWVTASDSGNPPLSSNVSLSLFVLDQN 

DNTPEILYPALPTDGSTGVELAPRSAEPGYLV 

TKVVAVDKDSGQNAWLSYRLLKASEPGLFA 

VGLHTGEVRTARALLDRDALKQSLVVAVED 

HGQPPLSATFTVTVAVADRIPDILADLGS1KTP 

l\Jrh,ULL>L 1 L > L V VA VAAVbCVrLAF VIVLLV 

LRLRRWHKSRLLQAEGSRLAGVPASHFVGV 

DGVRAFLQTYSHEVSLTADSRKSHLIFPQPNY 

ADTLLSEESCEKSEPLLM S DK VD ANKEERR V 

QQ APPNTD WRFSQ AQRPGTSGSQNG DDTGT 

WPNNQFDTEMLQ AMILAS ASE AADG S STLGG 

QNVYIPGSNATLTNAAGKRDGKAPAGGNGN 
KKKSGKKEKK 


1336 


2686 


A 


10379 


I 


557 


RPRRRQPSFS CR VL VLEDPPCFRFTNS MNQEK 
LAKLQAQVRJGGKGTARRKKXVVHRTATAD 
DKKLQSSLKKLAVNNIAGIEEVNMIKDDGTVI 
HFNNPKVQASLSANTFA1TGHAEAKPITEMLP 
GILSQLGADSLTSLRKLAEQFPRQVLDSKAPK 
PEDIDEEDDDVPDLVENFDEASKNEAN 


1337 


2687 


A 


10380 


1 


1263 


IPGSTISWSPAAARGLSVCRCCRLHPASAMDL 

FGDLPEPERSPRPAAGKEAQKGPLLFDDLPPA 

SSTDSGSGGPLLFDDLPPASSGDSGSLATSISQ 

MVKTEGKGAKRKTSEEEKNGSEELVEKKVC 

KASSVIFGLKGYVAERKGEREEMQDAHVILN 

AAQNLHQNLIRKFPKGDVISVEKTVKRCLLD 

TFKHTDEEFLKQASSQKPAWKJ3GSTATCVLA 

VDNILYIANLGDSRAILCRYNEESQKHAALSL 

SKEHNPTQYEERMRIQKAGGNVRDGRVLGV 

LEVSRS1GDGQYKRCGVTSVPDIRRCQLTPND 

RFILLACDGLFKVFTPFEAVNFN SPI FDFTCfO 

TREGKSAADARYEAACNRLANKAVQRGSAD 

NVTVMVVRIGH 


1338 


2688 


A 


10385 


3 


589 


GPSQSMAAGELEGGKPLSGLLNALAQDTFHG 

YPGITEELLRSQLYPEVPPEEFRPFLAKMRGIL 

KSLASADlVtDFNQLEAFLTAQTKKQGGITSDQ 

AAVISKFWKSHKTKJRESLMNQSRWNSGLRG 

LSWRVDGKSQSRHSAQ1HTPVAI1ELELGKYG 

QESEFLCLEFDEVKVNQILKTLSEVEESISTL1S 
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SDOCID: <WO 0157188A2_L> 



WO 01/57188 




PCT/U SO 1/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
Jr—rnenyiaianine, u— vjiycine, n xu^uuiiiL., 
l=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

<t TK ^anninn \/ — \/'olinf» W=TYvntr»nhan 

1 — i nreonine, v — vdiine, w ii^piupnoii, 

Y=Tyrosine. X=Unknown s *=Stop codon, 

/=possible nucleotide deletion, V=possible 

nucleotide insertion 














QPN 


1339 


2689 


A 


10386 


50 


390 


LGAMAKHHPDLIFCRKQAGVA1GRLCEKCDG 
KCVICDSYVRPCTLVRICDECNYGSYQGRCV1 
CGGPGVSDAYYCKECTIQEKDRDGCPKIVNL 
GSSKTDLFYERKKYGFKKR 


1340 


2690 


A 


10388 


113 


3472 


SQLRKGASA1 HSSPSR1DCIAQMMD1 YVCLK 

RPSWMVDNKRMRTASNFQWLLSTFILLYLM 

NQVNSQKKGAPHDLKCVTKNLQVWNCSWK 

APSGTGRGTDYEVCIENRSRSCYQLEKTSIKIP 

ALSHGDYEITINSLHDFGSSTSKFTLNEQNYSL 

IPDTPEILNLSADFSTSTLYLKWNDRGSVFPHR 

SNV1WE1KVLRKESMELVKLVTHNTTLNGKD 

TLHHWSWASDMPLECAIHFVEIRCYIDNLHFS 

GLEEWSDWSPVKNISWIPDSQTKVFPQDKVIL 

VGSDITFCCVSQEKVLSAL1GHTNCPLIHLDGE 

NVAIKIRNISVSASSGTNVVFTTEDNIFGTVIF 

AGYPPDTPQQLNCETHDLKEIICSWNPGRVTA 

LVGPRATSYTLVESFSGKYVRLKRAEAPTNES 

YQLLFQMLPNQEIYNFTLNAHNPLGRSQSTIL 

VNITEKVYPHTPTSFKVKDINSTAVKLSWHLP 

GNFAKINFLCEIEIKKSNSVQEQRNVTIKGVE 

NSSYLVALDKLNPYTLYTFRIRCSTETFWKW 

SKWSNKKQHLTTEASPSKGPDTWREWSSDG 

KNLIIYWKPLPINEANGKILSYNVSCSSDEETQ 

SLSEIPDPQHKAE1RLDKND YIIS V V AKN S VGS 

SPPSKIASMEIPNDDLKIEQWGMGKGILLTW 

HYDPNMTCDYVIKWCNSSRSEPCLMDWRKV 

PSNSTETVIESDEFRPGIRYNFFLYGCRNQGY 

QLLRSMIGYIEELAPIVAPNFTVEDTSADSILV 

KWEDIPVEELRGFLRGYLFYFGKGERDTSIvM 

RVLESGRSDIKVKNITDISQKTLRIADLQGKTS 

YHLVLRAYTDGGVGPEKSMYVVTKENSVGL 

IIAILIPVAVAVIVGV V I MLC YKlvKk w iKi, n i 

PDIPNPENCKALQFQKSVCEGSSALKTLEMNP 

CTPNNVEVLETRSAFPKIEDTEIVSPVAERPEN 

RSD AKPENH WES YCPPIIEEEIPNP AADETGG 

TAQVIY1DVQSMYQPQAKPEEEQENDPVGGA 

GYKPQMHLPINSTVEDIAAEEDLDKTAGYRP 

QANVNTWNLVSPDSPRSIDSNSE1VSFGSPCSI 

NSRQFLIPPKDEDSPKSNGGGWSFTNFFQNKP 

ND 


1341 


2691 


A 


10392 


1 


5057 


MLPPKHLSATKPKKSWAPNLYELDSDL'IKEP 

DVnGEGPTDSEFFHQRFRNLIYVEFVGPRKTL 

n<LRKLCLDWLQPETRTKEEIIELLVLEQYLTH 

PEKLKPWVRAKKPENCEKLVTLLENYKEMY 

QPEGESLHGVLVVSAGLRCPLGLSASTLLTW 

SGLDNSLSWAAVGMSCVLWDIELHHDFLGV 

ATKS V STHAQGDAAQGLGGTIVRM W ARDSN 

L ATG VLLDDNNSDVTSDDDMTRNRRES SPPH 

SVHSFSGDRDWDRRGRSRDTEPRDRWSI1TR 

NPRSRMPPRDLSLPVVAKTSFEMDREDDRDS 

RAYESRSQD AESYQN WDLAEDRKPHNTIQD 

NMENYRKJXSLGVQLAEDDGHSHMTQGHSS 

RSKRSAYPSTSRGLKTMPEAKKSTHRRGICED 

ESSHGVIMEKFIKDVSRSSKSGRARESSDRSQ 

RFPRMSDDNWKDISLNKRESVIQQRVYEGNA 

FRGGFRFNSTLVSRKRVLERKRRYHFDTDGK 

GSIHDQKGCPRKKPFECGSEMRKAMSVSSLS 

SLSSPSFTESQPIDFGAMPYVCDECGRSFSVIS 

EFVEHQIMHTRENLYEYGESFIHSVAVSEVQK 
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SBQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine O-Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenyl alanine.. G=GIycine, H^Histidine, 
I=lsoleucine, K=Lysine, I^Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutaminc, R=Argininc. S=Scrinc, 
T=Threonine, V= Valine. W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














SQVGGKRPECKDCGETFNKSAALAEHRKIHA 

RG YL VEC KNQECEE AFMPSPTFSELQKI YGK 

DKFYECRVCKETFLHSSALIEHQKIHFGDDKD 

NER.EHERERERERGETFRPSPALNEFQICMYG 

KEKMYECKVCGETFLHSSSLKEHQKIHTRGN 

PFENKGKVCEETFIPGQSLKRRQKTYNKEKLC 

DFTDGRDAFMQSSELSEHQKIHSRKNLFEGR 

GYEKSVIHSGPFTESQKSHTITRPLESDEDEKA 

FTISSNPYENQKIPTKENVYEAKSYERSVIHSL 

ASVEAQKSHSVAGPSKPKVMAESTIQSFDAIN 

HQRVRAGGNTSEGREYSRSVIHSLVASKPPRS 

HNGNELVESNEKGESSIYISDLNDKRQKIPAR 

ENPCEGGSKNRNYEDSVIQSVFRAKPQKSVP 

GEGSGEFKKDGEFSVPSSNVREYQKARAKKK 

YIEHRSNETSVIHSLPFGEQTFRPRGMLYECQ 

ECGECFAHSSDLTEHQKIHDREKPSGSRNYE 

WSVIRSLAPTDPQ'ISYAQEQYAKEQARNKCK 

DFRQFFATSEDLNTNQKIYDQEKSHGEESQGE 

NTDGEETHSEETHGQETIEDPVIQGSDMEDPQ 

KDDPDDKIYECEDCGLGFVDLTDLTDHQKVH 

SRKCLVDSREYTHSVIHTHSISEYQRDYTGEQ 

LYECPKCGESFIHSSFLFEHQRJHEQDQLYSM 

KGCDDGFIALLPMKPRRNRAAERNPALAGSA 

IRCLLCGQGF1HSSALNEHMRLHREDDLLEQS 

QMAEEAIIPGLALTEFQRSQTEERLFECAVCG 

ESFVNPAELADHVTVHKNEPYEYGSSYTHTS 

FLTEPLKGAIPFYECKDCGKSFIHSTVLTKHKE 

LHLEEEEEDEAAAAAAAAAQEVEANVHVPQ 

WLRIQGLNVBAAEPEVEAAEPEVEAAEPEV 

EAAEPN G E AEGPDGEAAEPIGE AG QPNGEAE 

QPNGDADEPDGAGIEDPEERAEEPEGKAEEPE 

GDADEPDGVGIEDPEEGEDQEIQVEEPYYDC 

HECTETFTSSTAFSEHLKTHASVniFEPANAFG 

ECSGYIERASTSTGGANQADEKYFKCDVCGQ 

LFNDHLSLARHQNTHTG 


1342 


2692 


A 


10393 


2 


1350 


GRPRSSSDNRNFLRERAGLSSAAVQTRIGNSA 

ASRRSPAARPPVPAPPALPRGRPGTEGSTSLS 

APAVLVVAVAVVVVVVSAVAWAMANYIHV 

PPGSPEVPKLNVTVQDQEEHRCREGALSLLQ 

HLRPHWDPQEVTLQLFTOGITNKLIGCYVGN 

TMEDWLVRIYGNKTELLVDRDEEVKSFRVL 

QAHGCAPQLYCTFNNGLCYEFIQGEALDPKH 

VCNPAIFRLLARQLAKIHAIHAHNGWIPKSNL 

WLKMGKYFSLIPTGFADEDINKRFLSDIPSSQI 

LQEEMTWMKEILSNLGSPVVLCHNDLLCKNII 

YNEKQGDVQFIDYEYSGYNYLAYDIGNHFNE 

FAGVSDVDYSLYPDRELQSQWLRAYLEAYK 

EFKGFGTEVTEKEVEILFIQVNQFALASHFFW 

GLWALIQAKYSTIEFDFLGYAIVRFNQYFKM 

KPEVTALKVPE 


1343 


2693 


A 


10394 


102 


839 


PEAQTSAVLAilEKGHLPTMRHEAPMQMASA 

TSFLFRYADDSFTSAFVSTVGIDFKVKTVFKN 

EKJIIKLQrWDTAGQERYRTITTAYYRGAMGFI 

LMYDITNEESFNAVQDWSTQIKTYSWDNAQ 

ViLVGNKCDMEDERVISTERGQHLGEQLGFE 

FFETSAKDNINVKQTFERLVDIICDKMSESLET 

DPAITAAKQNTRLKETPPPPQPNCAC 


1344 


2694 


A 


10395 


2 


4136 


DRPPWNSRVDDFVTNLIHLSSKGHrSPAKDTS 
LQQRTPAEMSP\ r LHFYVRPSGHEGAASGHTR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond i 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteine s 
D=Aspartic Acid, E=Giutamic Acid, 
F=Phenylalanine, G=G}yc\ne, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q-Glutamine, R-=Arginine, S=Serine, 
T=Threonine 3 V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














RKLQGKLPELQGVETELCYNVNWTAEALPSA 

EETKKLMWLFGCPLLLDDVARESWLLPGSN 

DLLLEVGPRLNFSTPTSTNIVSVCRATGLGPV 

DRVETTRRYRLSFAHPPSAEVEATALATLHDR 

MTEQHFPHPIQSFSPESMPEPLNGPIN 1LGEGR 

LALEKANQELGLALDSWDLDFYTKRFQELQR 

NPST VEAFDLAQSN SEHSRH WFFKGQLH VDG 

QICLVFISLFESIMSTQESSNPNNVLKFCDNSSA 

IQGKEVRFLRPEDPTRPSRFQQQQGLRHVVFT 

AETHNFPTGVCPFSGATTGTGGRIRDVQCTG 

RGAHVVAGTAGYCFGNLHIPGYNLPWEDLSF 

QYPGNFARPLEVAIEASNGASDYGNKFGEPV 

LAGFARSLGLQLPDGQRREWIKPIMFSGGIGS 

MEADHISKEAPEPGMEVVKVGGPVYR1GVGG 

GAASSVQVQGDNTSDLDFGAVQRGDPEMEQ 

KMNRVIRACVEAPKGNPJCSLHDQGAGGNG 

NVLKJELSDPAGAIIYTSRFQLGDPTLNALEIW 

GAEYQESNALLLRSPNRDFLTHVSARERCPA 

CFYGTITGDRRIVLVDDRECPVRRNGQGDAP 

PTPPPTPVDLELEWVLGKMPRKEFFLQRKPP 

MLQPLALPPGLSVHQALERVLRLPAVASKRY 

LTNKVDRSVGGLVAQQQCVGPLQTPLADVA 

WALSHEELIGAATALGEQPVKSLLDPKVAA 

RLAVAEALTNLVFALVTDLRDVKCSGNWM 

WAAKLPGEGAALADACEAMVAVMAALGVA 

VDGGKDSLSMAARVGTETVRAPGSLVISAYA 

VCPDITATVTPDLKHPEGRGHLLYVALSPGQ 

HRJLGGTALAQCFSQLGEHPPDLDLPENLVRA 

FSrTQGLLKDRLLCSGHDVSDGGLVTCLLEM 

AFAGN CGLQ V DYP VPRVD VLSVLFAEEPGL V 

LE VQEPD L AQ VLKR YRD AGLHCLELGHTGE 

AGPHAMVRVSVNGAVVLEEPVGELRALWEE 

TSFQLDRLQAEPRCVAEEERGLRERMGPSYC 

LPPTFPKAS VPREPG GPSPR VAILREEGSNGDR 

EMADAFHLAGFEVWDVTMQDLCSGAIGLDT 

FRGVAF VGGFS YADVLGSAKG W AAAVTFHP 

RAGAELRRFRKJRPDTFSLGVCNGCQLLALLG 

WVGGDPNEDAAEMGPDSQPARPGLLLRHNL 

SGRYESRWASVRVGPGPALMLRGMEGAVLP 

V W S AHGEG Y V AF S SPELQ AQIEARGL APLH W 

ADDDGNPTEQYPLNPNGSPGGVAGICSCDGR 

HLAVMPHPERAVRPWQWAWRPPPFDTLTTS 

PWLQLFINARNWTLEGSC 


1345 


2695 


A 


10396 


65 


642 


GVRGFWAGTMASRAGPRAAGTDGSDFQHRE 

RVAMHYQMSVTLKYEIKKLIYVHLVIWLLLV 

AKMSVGHLRLLSHDQVAMPYQWEYPYLLSI 

LPSLLGLLSFPRNNISYLVLSMISMGLFS1APLI 

YGSMEMFPAAQQLYRHGKAYRFLFGFSAVSI 

MYLVLVLAVQVHAWQLYYSKKLLDSWFTST 

QEKKHK 


1346 


2696 


A 


10398 


1 


718 


" DDFVRCGPQSAAMGASARLLRAVIMGAPGS 
nvr nr V s TTTH FELKHLSSGDLLRDNMLRGT 
EIGVLAKAFIDQGKLIPDDVMTRLALHELKNL 
TQYSWLLDGFPRTLPQAEALDRAYQIDTVINL 
NVPFEVIKQRLTARWIHPASGRVYNIEFNPPK 
TVG1DDLTGEPLIQREDDKPETVTKRLKAYED 
QTKPVLEYYQKKGVLETFSGTETNKIWPYVY 
AFLQTKVPQRSQKASVTP 


1347 


1597 


A 


10402 


153 


1969 


KHRQENNALDMAPEIHMTGPMCLIENTNGEL 
VANPEALKJLSAITQPVVWAIVGLYRTGKSY 
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SEQ ID 

INU. 01 

nucl- 
eotide 
seq- 
uence 


SEQ ID 

INU. 01 

peptide 
seq- 
uence 


Met 
nou 


SEQ 

lU INU. 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIaninc OCystcine, 
D=Aspartic Acid, E— Glutamic Acid., 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, W=Tryp t0 Phan, 
Y-Tyrosine, X- Unknown, *=^Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














LMNKLAGKNKGFSLGSTYKSHTKGIWMWCV 

PHPKKPEHTLVLLDTEGLGDVKKGDNQNDS 

WIFTLAVLLSSTLVYNSMGTINQQAMDQL^ 

VTELTHRIRSKSSPDENENEDSADFVSFFPDFV 

WTLRDFSLDLEADGQPLTPDEYLEYSLKLTQ 

Gl'SQKI)iaslFNLPlU-ClRKFFPKJCKCFVFDLPI 

HRRKLAQLEKLQDEELDPEFVQQVADFCSYI 

FSNSKTKTLSGGIKVNGPRLESLVLTYINAISR 

GDLPCMENAVLALAQIENSAAVQKAIAHYD 

^^MU^KV^LrAb 1 LQbLLDLHRVbhREATEV 

Yl^NSFKX)VDfILFQKXLAAQLDKKRDDFCK 

QNQEASSDRCSALLQVIFSPLEEEVKAGIYSK 

PGGYCLFIQKLQDLEKKYYEEPRKGIQAEEIL 

QTYLKSKESVTDAILQTDQILTEKEKEIEVEC 

VKAESAQASAKMVEEMQIKYQQMMEEKEKS 

YQEHVKQLTEKMERERAQLLEEQEKTLTSKL 

QEQARVLKERCQGESTQLQNE1QKLQKTLKK 

KTKRYMSHKLKI 


1348 


2698 


A 


10404 


5 


892 


TQLPAPLSGVLSRLQLGSGAPLLTWVQETAG 

VAGGAPRRJtTPVTMWRLLARASAPLLRVPLS 

DSWALLPASAGVKTLLPVPSFEDVSIPEKPKL 

RFIERAPLVPKVRREPKNLSD1RGPSTEATEFT 

EGNFAILALGGGYLHWGHFEMMRLTINRSM 

DPKNMFAIWRVPAPFKPITRKSVGHRMGGGK 

GAIDHYVTPVKAGRLVVEMGGRCEFEEVQG 

FLDQVAIIIvLPFAAKAVSRGTLEKMRKDQEE 

RERNNQNPWTFER1ATANMLG1RKVLSPYDL 

THKGKYWGKFYMPKRV 


1349 


2699 


A 


10409 


59 


1184 


LRRNCSALGGLFQTUSDMKGSYPVWEDF1NK 

AGKLQSQLRTTVVAAAAFLDAFQKVADMAT 

NTRGGTREIGSALTRMCMRHRSIEAKLRQFSS 

ALIDCLINPLQEQMEE WKK V ANQLDKDH AK 

EYKKARQEIKJ^SSDTLKXQKKj^lKKGRGDIQ 

PQLDS ALQD VNDKYL LLEETEKQ A VRKAEIE 

ERGRFCTFISMLRPVIEEEISMLGEITHLQTISE 

DLKSLTMDPHKLPSSSEQVILDLKGSDYSWS 

YOTPPSSPSTTMSRKSSVCSSLN^VNSSDSRSS 

GSHSHSPSSHYRYRSSNLAQQAPVRLSSVSSH 

DSGFISQDAFQSKSPSPMPPEAPNQRRKEKRE 

PDPNGGGPTTASGPPAAAEEAQRPRSM 


1350 


2700 


A 


10410 


511 


958 


AGRGGPGKPVSWSSGPGSPGQTQRRSWVKST 

RGHSSLLPPSQDFVAGLSVILRGTVDDRLNW 

AFNLYDLNKDGCITKEEMLDIMKSIYDMMG 

KYTYPALREEAPREHVESFFQKMDRNKDGV 

VTIEEFIESCQKDENIMRSMQLFDNVI 
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WHAT IS CLAIMED IS: 
1 An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-1350, a mature protein coding portion of SEQ ID NO: 1-1350, an 
active domain of SEQ ID NO: 1-1350, and complementary sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

3 An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
associated with a regulatory sequence that modulates expression of the polynucleotide in the host 
cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent conditions 
with any one of SEQ ID NO: 1-1350. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 
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13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the polynucleotide 
of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1 ; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 



15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 



17. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound complex is 
detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound diat binds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound with the polypeptide of claim 1 0, in a cell, under 
conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 
of claim 10 is identified. 

19. A method of producing the polypeptide of claim 1 0, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected from 
the group consisting of a polynucleotide sequence of SEQ ID NO: 1-1350, a mature protein 
coding portion of SEQ ID NO: 1-1350, an active domain of SEQ ID NO: 1-1350, 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1-1350, under conditions sufficient to express the polypeptide in said 
cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1351-2700, the mature protein portion thereof, or the active domain 
thereof. 

21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide array. 

22. A collection of polynucleotides, wherein the collection comprises the sequence 
information of at least one of SEQ ID NO: 1-1350. 

23 . The collection of claim 22, wherein the collection is provided on a nucleic acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of the 
polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any one of the 
polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer-readable 
format. 
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27. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising a polypeptide of claim 10 or 20 and a 
pharmaceutical ly acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising an antibody that specifically binds to a 
polypeptide of claim 10 or 20 and a pharmaceutical!}' acceptable carrier. 
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